Rpf 


niLo 






Dpfaiilt 


Pli tralc 
r lui dio 


Timp ^tamn 
1 IMIC OLdiiip 


# 








Operator 






Ll 


294 


Yranriniihi^= with =^arnhl*fe^ Van ■^^'^■-■^^■^ 




..^fX... ...... ....... 


:-;V./l i*:: : ■ 








f(aWrl<"2hni 0702"^ and r'7h7'7<fe 


:::U jr:M:l:y:;:;::::;:; 










■ ; ■ ■ ; , ■ ■ ; 








■■*■■■;;■;■•;*■;;•-;■ 


ill'::';:;:!"!:! • r- '. :! 


L2 


25 


Ll and (database with partition) 


US-PGPUB; 


OR 


ON 


2004/12/16 13:35 








USPAT; 














IBM.TDB 








: : : LJ. r : 


b 

v.. 


■^1^1 -and /^PYtrahnlat^^^ with Tphllr*fe 
:with hartitinnV'- 

. . VV. 1 Ll 1 . yJsI * • W Ul 1 y; — 


i^iiijCipGpijRVi; 

::;W*J::r:VJr.VU/ :; 
USPAT =^ • 


\ r\D :;• . 

■.::\J.f\-::::::::::::::: 


:::v/.i?l::::::::::: 


2004/12/16 l i-^6^ ' 


1 4 


n 

u 




: i^MJjDB;: : 


OR 


ON 








dptprmin43^ with reolictQ with 


USPAT' 












Dartition) 


IBM_TDB 












■•11 ■ and YrP nil r4;Q- with ns^^^ 


::\J 0..fr:\3r. KJO f 


cik 

:::wr\::::::::;::::::: 


::;Vj/.IN::::-:::::: 


2004^2/16 ■t^'^7^ 






















jIibm^db ; ; 


:::::::::::::::::::::::::: 








1 
1 






OR 


ON 


7004/07/06 11 '17 








M^PAT' 














IBM TDB 








S2:. : 


:-6:; 


:;' 5511190 .pn,: or ' 5675786 .pn. 


: US-PGPUB; : 




ON ^ 


2004/09/29. 19:17 






■ or "5970495" nri or "6049861" 


USPAT- ^ 












■ nn hr="6n92nR5" nn or 


:::XUI,,lj_^:l :: r:: , 


























; "512861 l".pni - ■ 












7 

/ 


^QdtaDase or recoru^^ or itic^zj 


Uo-rorUD, 


UK 




'5on'^/n7/'?fi ii;-n7 

ZUUj/U//ZD ID.U/ 






with random with ^amnlino^ ah 

VVILII lullUWlll VVILII JOI \ tyjltl J tOU t 


USPAT- 












and (@ad<"20010702") 


IBM_TDB 












YYdata hacp ^ or rprfSrd ^ ^- or^fi |p 


US-PGPUB"^ 


::::::: 


ON 


2004/01/15=08-30=^ 






with random^ with: Qamnlinn^:and :: :::.: 


USPAT-^^ 

...\jyjrr\ I. J ........... 






::■:!:■:'■:; :::;:::::.•:::•:■:;.::■!:::.;::: 






f(S)ad<"206lb702"V^ ■ 


IBM TDB 
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((random with sampling) same 


US-PGPUB; 


OR 


ON 


2004/09/30 09:29 






(replicat$3 or reproduc$3 or (re 


USPAT; 












aHi nmHllr**t^^^ camp ^datahacp nr 
duj [jruuuccpjy^ bdiiic ^udidUdbc ui 


TRM TDR 












filp<t9 nr rprnrd't^^^ and 
























S6 : 


...60. 


(("5511190".pn. or '•5675786".pn. 




US-PGPUB; 


or: . ;•:: 


ON • 


2bd3/07/12::13:5l' 






or "5970495".pn. or "6049861". 


USPAT; 








:. ' I'. 




pn..or "6092d85".pri. or . • 


IBM_TDB 












::"6128611".pn.). or ((random with 


■■: ::• .• ' . 












i sampling :with;(databasedr;file$2::^ 


\ 'y ■...:'■ • ■ 












or record$2)):andJ(repiicat$3 br;^^ 














; reproduG$3 or (re adj produc$3))) [ 














or (random with; sampling with;; 














(database or file$2- or record$2)): : 














:or ((replicat$.3 or reprpduc$3;or . ■ 














.(re adj produc$3)) and .(random 














.with:Sampling))) arid. . 














'(@ad<."200107.02") and '7077$. 














• ccls. ■■' : rr;}.-'^. / 
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S7 


79 

. ................. ..... 


((random with sampling) same 
(database or file$2 or record$2)) 

and frpnlirafi*^ nr rpnrndiir'fe'^ nr 
rpopnprati"^ nr frp arii nrnrliir*fe'^^ 

or (re adj generat$3)) and 


US-PGPUB; 
USPAT; 
IBM TDB 


OR 


ON 


2003/07/12 17:39 


::J\J::::::: 


::;::::::::;::;;■:::;■;/,::: 


■ ran Hniiri with- ^Anri 

Ol lUVJIJ 1 : VV ILI I : DOl 1: 1^1 If l^ Ji OOl 1 IC:::::::::::: 


:::U J..r;Vjr\.IDy .■ 


::^^f\.:::;-::::;:;;:: 


:::WIM:::j::::::: 


■ ■ U y j/: y / / : 1^: ; ;±; O > 4.^' ; ■ 






; (data base or fi le$ 2 or reco rd $ 2 ) ) • • 
:^nd: (replicat$3 br: reprbduc$3:0r.: ; 


USPAT; -^^^ 
























regenerat$3 or (re:adj^!prdduc$3) : ■: 




























: 6r:(re;adj;igeherat$3)):arid:;;:;;:;^^ 
; (@ad;<l;2p010702:":) and ''707:7$:: 














(random with samolina^ and 

1 1 \^ i 1 V4 III ¥ T 1 Wl i III V 1 1 1 1 \m § VI 1 1 VI 

^vjaCauobc wiin pariiuun^jj anu 

(@ad<"20010702") 










S9 


27 


US-PGPUB* 

1 ICPAT* 

IBM.TDB 


OR 

V^l X 


ON 


2003/07/12 20'07 






(:"551119d'!vpn;or; :"5675786''ip • ii 


; US-PGPUB;- 




::(?N:;;:,- 


:imil(S7;i^ 18:56: 






or "597b495"..pn; or "6049861!! • ^ : : : 


iijSPAT;: :! 












pn!: or "6d92085'^ph; bK ; ; i 
"61286il"vpn0 and (randbiti with ' '. 
:sampl$3) 


ilBMiitlDBi:: 




















Sll 


7 


((replicat$3 or copy$3 or 
reproduc$3 or regenerat$3 or (re 
adj produc$3) or (re adj 
generat$3)) with (part$3 or 
selectiv$5) with database).ab. and 
(replicat$3 with database) and 
(@ad<"20010702") and "7077$. 
ccls. 


US-PGPUB; 

USPAT; 

IBM.TDB 


OR 


ON 


2003/08/09 13:19 


S12 : 




(random$3 wjth sampl$5) and; ; 
i:((replicat$3 pr:reprpduc$3 6^ i ;;: 
adj"produc$3):or cop$5) with ^ 
database):and:(@ad^;r200ip702';) ; 


:US-PGPUB;: 

i;USPAT;::ilpN:; 

iBMiTDf::; 




■ ■' *• 

VON;: \. 


.2003/07/26 20:59: 
















S13 


25 


((analy$5 or comput$3 or 
calculat$3 or determin$3 or 
reporc^pj; wiin pariiuun^o witn 
Size) and (random$3 with 
sampl$5) and (@ad<"20010702") 


US-PGPUB; 
USPAT; 

TRM THR 


OR 


ON 


2003/08/09 13:18 


S14 




((replicat$3 or copy$3:6r 


US-PGPUB: 


OR 


bN= 


2003/08/09 13:53 






:;TeproduG$3:6r regenerat$3 :0r:;{re;-::^^ 
adj produc$3):pr (re ad^ 
generat$3j) -with partition iw 


USPAT;:: . 
IBM TDB : 






















database);and(@ad<"20010702")::i 






















S15 


29 


(random with sampling) and 
(database with (rephcat$5 or 
cop$5 or partition$3)) and 
(@ad<"20010702") and "7077$. 
ccls. 


US-PGPUB; 

USPAT; 

IBM.TDB 


OR 


ON 


2003/08/10 15:59 


S16 : 




"5179699".pn;:andXpartitibn 
;bpundar$5); 


:US-PGPUB;:: 

USPAT;-: 

IBMiTDB:- 




6n= . 


' 2003708/10 18:07} 
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(random with sampl$3 with 
(database$2 or table$2)) and 
(@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/01/15 08:33 


:S18-;i 


^ 4p;: 


::(random;wjt^^ 


•lijS-PGPUi:: 
.USF>Af viT^:^: : 
i&l^jTtbB 






;2004/p9/2?J:l:8:02;;; 






. adj in) or (built adj in) or h 
integrat$5:or includ$3j6r:;"have':ior: 
"has" or "having") with : i. 
(datab^se$2;oi^ table$2)) and- ':■ 
(@ad<!?06l6702") : ! .;; ■; 








S19 


5 


((random near2 sampl$3) same 
(database$2 near3 manag$9)) and 
(@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/02/06 10:02 


S2.0.; : 


■ ; 82'; 


(random:nMr2 sampl$3) and > 


.USrPGPUB;; 






: 2004/02/06 ::10:27;!: 






((databaseior d^teiiWithrhnining) : 
and (@ad<"200lb702") ; ^l: V 


USPAT; 
: IBi^^TDB; : !! 










i. 


((ranaom nearz sampi^^; wiin 
(integrat$3 or part or ((build or 
built) adj in)) with database) and 
(@ad<"20010702") 


1 IC D^DI IP- 

USPAT; 
IBM_TDB 


OD 
UK 


UIN 








(sampling with databaise) and : 

;.;((i)ad<"20q^^p7p2">;:;; 


: US-PGPUB; 
■■■ USPAT; ; 


•.OR;;-:v;;;;; 


::ON:'i;:.;;;;- 


r2004/02/06.1i:06:: 








llM_tDB j; 








S23 


57 


(sampling same (database near3 
manag$9)) and 
(@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/02/06 11:07 


S24; 




;:i(sa tripling ;sarne:(database:ne^^ 
:maiia^$9) same partition$3) and .; 

^■(@ad<:"2ppi07p2");:;;:.I 


US-PGPUB; 
USPAT; ■ 
IBMITDB 


OR . 




2004/02/0611:07:; 












S25 


1 


"5710915".pn. 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/02/06 11:15 


S26 




^: "6253300" .pn; .' ■ : i 


US-PGPUB; 
: USPAT; 






: 2004/02/06:11:18 








■IBMiTDB- 








S27 


1 


"5950185".pn. 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/02/06 12:13 


S28 ^ 




"20030004944;^ ; ^ : 


US-PGPUB;' 
■USPAT; . • 


OR '\ 


ON . : 


.2004/09/29 16:11 


■ 1 








IBM_TDB: : 








S29 


0 


(random with sampl$3 with ((build 
adj in) or (built adj in) or 
integrat$5 or includ$3 or "have" or 
"has" or "having") with 
database$2 with management) 
and (@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/09/29 18:41 
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(random with sannpl$3 with ((build 
adj in) or (built adj in) or 
integrat$5 or includ$3 or "have" or 
"has" or "havina") with 
aaidDdse^zj ana 
(@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/09/29 18:03 


S31 




^(sampling :with:((build adjiin);or. ^ 


: US-PGPUB;^ 


■^QR;^;-'-'. 


^•■QN:Q 


:i 2004/09/2?^ 






/(built adj in) 6rrihtegrat$5 6r ^: 














iriclud$3 br;iricorpdrat$3 or:"have " 
on=: has "or : haying") with i: 

dateba^e$2) ahdlr::!. ji^:^^^^^ 
(@ad<"200l67p2")]an^ ; : 

GClS..-: ■;: • i;-/'!!.: i-^iL'-: \- 


;tp6;^JPd;K 

:;:DEkWENT;;; 
•ilBMltpBji: 






























S32 


0 


(((build adj in) or (built adj in) or 
integrat$5 or includ$3 or 
incorporat$3) with database$2 
with (management or system) 
with (less or few$5) with 
((function near2 call) or (input 
near3 output) or "i/o" or "I/O")) 
and (@ad<"20010702") 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBI^_TDB 


OR 


ON 


2004/09/29 19:01 


S33 




"551119d''^bn.: br "5575786"iph. 
br';5970495";pnJior "6049861": : - 
pni^or "6092085":pni:or: :i ; : . 
";51286ii":[3n.: or "5179699"^^ . 
or "5675786':^pn: or "6185559":. 
pn.:br "57id915".ph.i;Or : 
: :"625330p?;lpn, or :';6542886":;pn, ; : 


::U$-p6pUb;V 

^•USPAT;;v: 

::ibm_tdb .: 






i: 2004/09/2^ 19: 


S34 


2 


(random with sampling) and 
((determin$3 or anal$9) with 
database with partition) and 
(@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/09/ jO 09:45 


S35 : 




(rbridbm wjth samplih^ sndi;: ; ; 
((replicat$5 or copy$3) With : ;;;; 
datebase):dnd (@ad^ 


US-PGPUB;| 
: :USPAT;:: ; 
:#0; jPd; ^ 
!i;DERWENT; 


oRfiiir 


ION 


2004/09/30.10:15,; 


S36 


3 


(random with sampling) and 
((determm$3 or anal$9) with 
database with (partition or size)) 
and (replicat$3 or copy$3) and 
(@ad<"20010702") 


US-PGPUB; 

USPAT; 

IBM_TDB 


OR 


ON 


2004/09/30 09:47 


S37 


13 


(randorh with sampjing) and. . :. . 
((determin$3:or anal$9).with : 
database. with (partitipn or:Si?e:or 
. boiiridar^7)).and : . 
(@?d<"20pi^^^^^^ 


;us-pgf^Ub; 

USPAT; ■:: 
IBM_TPB^^" 




5n- ;;. 


;2d04/09/30 09:59 


S38 


37 


(random with sampling) and 
((determin$3 or anal$9) with 
(database or table) with (partition 
or size or boundar$7)) and 
(@ad<"20010702") 


US-PGPUB; 

1 If* Pi AT. 

USPAT; 
IBM_TDB 


OR 


ON 


2004/09/30 10:31 
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S39 


24 


S40 




S41 


7 


■'S42i::- 




S43 


14 


■ S44 


: 94' 


S45 


12 


S46- 


: 93: 


S47 


76 


S48 : 


2 



((random with sampling) and 
((determin$3 or anal$9) with 
(database or table) with (partition 
or size or boundar$7)) and 
(@ad<"20010702")) not ((random 
with sampling) and ((determin$3 
or anal$9) with database with 
(partition or size or boundar$7)) 
and (@ad<"20010702")) 

'(random yyithisampiirig) and / 
:((d6feiinriin|3 ot 

;(database or table) with :(partitipn 
:or size ()r!bpun:dar$7)):an^^ 
;((repliceit^3 pridbpy$5 
with:(database or table)) ^and^ 
(@ad<p0lb702") 'm- 

((determin$3 or anal$9) with 
database with partition) and 
(replicat$3 with database) and 
(@ad<"20010702") 



i':'5564221!'.pn;. 



(random with sampling) and 
((determin$3 or anal$9) with 
(database or table) with (partition 
or size or boundar$7)) and 
(distribut$3 with (database or 
table)) and (@ad<"20010702") 



: (random iwith sampling):;and ; 
((i-eplicat$5 di-;cppy$3:pr:. J 
distribut$3) with database) and 

;:(@ad:<;:"200i070i';) r;vi;y; 



((replicat$3 or copy$5) with 
database).ab. and ((determin$3 or 
anal$9) with (database or table) 
with (partition or size or 
boundar$7)) and 
(@ad<"20010702") 



;(repliGat$3 :with:database).abi and 
(@ad^"2b01p702'') ■ iKLij 



(replicat$3 with database).ab. and 
(@ad<"20010702") and "7077$. 
ccls. 

(rei)licat$i>itli databeise).a 
(sampling) and:::::! 
(@ad<"20bi070i"): - 



US-PGPUB; 
USPAT; 
IBM TDB 



OS-PGPUB;: 

iUspAl';:;;^ : 
IBM TDB:: i 



US-PGPUB; 

USPAT; 

IBM_TDB 

;US-PGPUB;I 
:USPAT; 
EPO; JPO; 
DiRWENT; 

ibmitdb; ; 

US-PGPUB; 
USPAT; 
IBM TDB 



iUS-PGPUB;; 
USPAT; i-i: : 
EPO;: JPO;: 
D^RWEINiT; 

ibmItdb;;;; 

US-PGPUB; 
USPAT; 
IBM TDB 



US-PGPUB; 

USPAT; 

IBM_TDB 

US-PGPUB; 

USPAT; 

IBM_TDB 

US-F^Gi'UB;. 
USPAt; : ; 
IBM: TDB' 



OR 



:0R 



OR 



OR. 



OR 



;0R 



OR 



OR^ 



OR 



OR 



ON 



ON 



ON 



ON? 



ON 



ONi 



ON 



ON 



ON 



2004/09/30 10:00 



12004/09/30 10:16; 



2004/09/30 10:03 



;20o4/o9/:3:o:;io:ii 



2004/09/30 10:16 



=2004/09/30 10:3.1 



2004/09/30 10:34 



2004/09/3010:34 



2004/09/30 10:36 



:2dd4/p9/3d=l6:5i 
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44 






S51 


231 


jS52:.' 





((replicate or replicating) with 
database) and (sampling) and 
(@ad<"20010702") 

((replicate or replicating) with; 
'database).ab. arid ; 

:{@ad<;i2odid>!6i2'!) 



((replicate or replicating) with 
database) and "707'7$.ccls. and 
(@ad<"20010702") 

:(((rep.l.icate 6 replicating) with : 
;;datab^sb);sanie[((size:0^^ 
or bouhdai^B) with database)) ::^!!: 
and •'7077$.ccls: and ^ : 
(@ad<i!200107()2") ii^ 



US-PGPUB; 

USPAT; 

IBM.TDB 

■lijS^PGPUB;! 
USPAT; : ' :■; 



US-PGPUB; 

USPAT; 

IB|V|_TDB 

US-PGPUB; 
USI^At; ii: 
IBM TDB 



OR 



or: 



OR 



OR- 



ON 



ON 



ON 



ON 



2004/09/30 11:45 



;2q& 


4/0 



















2004/09/30 11:50 



2004/09/30 jili: 5 li 
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1 Advanced tutorials: Software for unifornn random number generation: distinguishing the 

good and the bad 
Pierre L'Ecuyer 

December 2001 Proceedings of the 33nd conference on Winter simulation 

Full text available: ^pdf(175.96 KB) Additional Information: full citation , abstract , references , index terms 

The requirennents, design principles, and statistical testing approaches of unifornn randonn 
number generators for sinnulation are briefly surveyed. An object-oriented random number 
package where random number streams can be created at will, and with convenient tools 
for manipulating the streams, is presented. A version of this package is now implemented in 
the Arena and AutoMod simulation tools. We also test some random number generators 
available in popular software environments such ... 

2 Implementin g a random number package with splitting facilities 
Pierre L'Ecuyer, Serge Cote 



u 



March 1991 ACM Transactions on Mathematical Software (TOMS), Volume 17 issue i 

Additional Information: full citation , abstract , references , citings , index 

terms , review 



Full text available: ^ pdf(794.01 KB) 



A portable set of software tools is described for uniform random variates generation. It 
provides for multiple generators running simultaneously, and each generator has its 
sequence of numbers partitioned into many long (disjoint) substreams. Simple procedure 
calls allow the user to make any generator "jump" ahead to the beginning of its next 
substream, back to the beginning of its current substream, or back to the beginning of its 
first substream.... Implementation Issue ... 



Keywords: combined generators, disjoint streams, portability, repeatability 



3 On the Deng-Lin random num ber generators and related methods 
Pierre L'ecuyer, Renee Touzin 

January 2004 Statistics and Computing, Volume 14 issue i 

Full text available: p^^ijg^g^ gj^g Additional Information: full citation , abstract , index terms 

We study the structure and point out weaknesses of recently proposed random number 
generators based on special types of linear recurrences with small coefficients, which allow 
fast implementations. Our theoretical analysis Is complemented by the results of simple 
empirical statistical tests that the generators fail decisively. Directions for improvement and 
alternative generators are also pointed out. 
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Keywords: efficient generator, lattice structure, multiple recursive generator (MRG), 
statistical test of randomness 



^ OCB: A block-cipher nnode of operation for efficient authenticated encryption 
Phillip Rogaway, Mihir Bellare, John Blacl< 

August 2003 ACM Transactions on Information and System Security (TISSEC), Volume 6 
Issue 3 

Full text available: ^ pdf(568.74 KB) Additional Information: full citation , abstract , references , index terms 

We describe a parallelizable block-cipher nnode of operation that sinnultaneously provides 
privacy and authenticity. OCB encrypts-and-authenticates a nonennpty string M &in; 
&lcub;0, l&rcub;* using □&vertbar;M&vertbar;/nn + 2 block-cipher invocations, where n 
is the block length of the underlying block cipher. Additional overhead is small. OCB refines 
a scheme, lAPM, suggested by Charanjit Jutla. Desirable properties of OCB include the 
ability to encrypt a bi ... 

Keywords: AES, authenticity, block-cipher usage, cryptography, encryption, integrity, 
modes of operation, provable security, standards 



Modelling the Internet: On characterizing affinity and its impact on network 
performance 

Gabriel Lucas, Abhishek Ghose, John Chuang 

August 2003 Proceedings of the ACM SIGCOMM workshop on Models, methods and 
tools for reproducible network research 

Full text available: pdf(236.10 KB) Additional Information: full citation , abstract , references 



An important component of simulation-based network research is the selection of nodes to 
a member group, such as receivers in a multicast group or web clients in a content delivery 
network. In a seminal paper. Philips eta/, introduce an algorithm for generating member 
groups with different degrees of affinity (clusteredness) and show that affinity can have a 
significant effect on multicast efficiency. Subsequent studies applying this algorithm have all 
used the algorithm's input paramete ... 

Storage: Deconstructing storage arrays 

Timothy E. Denehy, John Bent, Florentina I. Popovici, Andrea C. Arpaci-Dusseau, Remzi H. 
Arpaci-Dusseau 

October 2004 Proceedings of the 11th international conference on Architectural 
support for programming languages and operating systems 

Full text available: Qpdf{1.74 MB) Additional Information: full citation , abstract , references , index terms 

We introduce Shear, a user-level software tool that characterizes RAID storage arrays. 
Shear employs a set of controlled algorithms combined with statistical techniques to 
automatically determine the important properties of a RAID system. Including the number 
of disks, chunl< size, level of redundancy, and layout scheme. We illustrate the correctness 
of Shear by running it upon numerous simulated configurations, and then verify its real- 
world applicability by running Shear on both software-based ... 

Keywords: RAID, storage 



7 Marginally specific alternatives to normal ARMA processes 
Lee S. Dewald, Peter A. W. Lewis, Ed McKenzie 

Decennber 1987 Proceedings of the 19th conference on Winter simulation 

Full text available: ^ pdf(261.14 KB) Additional Information: full citation , abstract , references , index terms 

In many practical cases in tinne series analysis, marginal distributions in stationary 
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situations are not Gaussian. It is therefore necessary to be able to generate and analyze 
non-Gaussian time series. Several non-Gaussian time series models are discussed in this 
paper. The marginal distributions are Laplace or l-Laplace distributions, and the correlation 
structure of the processes mimics that of the standard additive, linear, constant coefficient 
ARMA(p,q) models. 

8 Analysis methodology I: A new class of linear feedback shift register generators 
Pierre L'Ecuyer, Francois Panneton 

December 2000 Proceedings of the 32nd conference on Winter simulation 

Full text available: ^ pdf(77.30 KB) Additional Information: full citation , abstract , references , citings 

An efficient implennentation of linear feedback shift register sequences with a given 
characteristic polynomial is obtained by a new method. It involves a polynomial linear 
congruential generator over the finite field with two elements. We obtain maximal 
equidistribution by constructing a suitable output mapping. Local randomness could be 
improved by combining the generator's output with that of some other (e.g., nonlinear and 
efficient) generator. 

9 Guest introduction 
Pierre L'Ecuyer 

October 2003 ACM Transactions on Modeling and Computer Simulation (TOMACS), 

Volume 13 Issue 4 

Full text available: ^ pdf(46.47 KB) Additional Information: full citation , index terms 
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''0 Input process models Q 
p. A. Jacobs, P. A. W. Lewis, E. McKenzie 

December 1983 Proceedings of the 15th conference on Winter Simulation - Volume 2 

Full text available: ^ pdf(124.75 KB) Additional Information: full citation , abstract , references , index terms 

Tinne series models for positive-valued and discrete-valued input processes are discussed, 
witfi the ennphasis on the simulation problems which arise in generating time series from 
these models. 

1 1 Analysis methodology: Simulation input analysis: difficulties in simulating queues with I I 
Pareto service 

Donald Gross, John F. Shortle, Martin J. Fischer, Denise M. B. Masi 

Decennber 2002 Proceedings of the 34th conference on Winter simulation: exploring 
new frontiers 

Full text available: ^ pdf(204.92 KB) Additional Information: full citation , abstract , references 

M/G/1 queues, where G is a heavy-tailed distribution, have applications in Internet 
modeling and modeling for insurance claim risk. The Pareto distribution is a special heavy- 
tailed distribution called a power-tailed distribution, and has been found to serve as 
adequate models for many of these situations. However, to get the waiting time 
distribution, one must resort to numerical methods, e.g., simulation. Many difficulties arise 
in simulating queues with Pareto service and we investigate wh ... 

^2 Provably fast integer factoring with quasi-uniform snnall quadratic residues I I 

B. Vallee 

February 1989 Proceedings of the twenty-first annual ACI^ symposium on Theory of 
computing 

Full text available: pdf(873.23 KB) Additional Information: full citation , abstract , references , index terms 



Finding small quadratic residues modulo n, when n is a large composite number of unknown 
factorisation is almost certainly a computationally hard problem. This problem arises in a 
natural way when factoring n by the use of congruences of squares. We construct here a 
polynomial-time algorithm based on the use of lattices, which finds in a near uniform way 



a ac e of ?c G E G E C 



C OKEN 



Results (page 1): (random <near/2> sampl*)<and>(database<sentence>partition) 



Page 4 o 



quadratic residues mod n that are smaller than 0(n2/3). In this way, we derive a class of 
integer factohsati ... 

13 A Taxonomy of Global Optimization Methods Based on Response Surfaces I I 

Donald R. Jones 

December 2001 Journal of Global Optimization, volume 21 issue 4 
Full text available: (g| p^j^jj^^^^ gjjg Additional Information: full citation , abstract 

This paper presents a taxonomy of existing approaches for using response surfaces for 
global optimization. Each method is illustrated with a simple numerical example that brings 
out its advantages and disadvantages. The central theme is that methods that seem quite 
reasonable often have non-obvious failure modes. Understanding these failure modes is 
essential for the development of practical algorithms that fulfill the intuitive promise of the 
response surface approach. 



Keywords: global optimization, kriging, response surface, splines 



14 Detection of Cheaters in Vector Space Secret Sharing Schemes I I 

Carles Padro, German Saez, Jorge Luis Villar 

January 1999 Designs, Codes and Cryptography, volume I6 issue 1 

Full text available' S Additional Information: full citation, abstract , references , citings , index 

'* ^ Publisher Site terms 

A perfect secret sharing scheme is a method of distributing shares of a secret among a set 
P of participants in such a way that only qualified subsets of P can reconstruct the secret 
from their shares and non-qualified subsets have absolutely no information on the value of 
the secret. In a secret sharing scheme, some participants could lie about the value of their 
shares in order to obtain some illicit ... 



Keywords: detection of cheaters, information rate, secret sharing schemes, unconditionaly 
security in secret sharing 



Ba yesian Sampling and Ensemble Learning in Generative Topographic Mapping I I 
Akio Utsugi 

December 2000 Neural Processing Letters, volume 12 issue 3 

Full text available: ^ py^iisher Site Additional Information: full citation , abstract , references , index terms 

Generative topographic mapping (GTM) Is a statistical model to extract a hidden smooth 
manifold from data, like the self-organizing map (SOM). Although a deterministic search 
algorithm for the hyperparameters regulating the smoothness of the manifold has been 
proposed previously, it is based on approximations that are valid only on abundant data. 
Thus, it often fails to obtain suitable estimates on small data. In this paper, to improve the 
hyperparameter search in GTM, we construct a Gibbs sa ... 

Keywords: Gibbs sampler, Laplace method, Markov chain Monte Carlo, elastic net, 
evidence, hyperparameter search, self-organizing map, variational free energy 



Performance evaluation of multiple time scale TCP under self-similar traffic conditions I I 

Kihong Park, Tsunyi Tuan 

April 2000 ACM Transactions on Modeling and Computer Simulation (TOMACS), Volume 
10 Issue 2 

Full text available: ^pdf(264.71 KB) Additional Information: full citation , abstract , references , index terms 
Measurements of network traffic have shown that self-similarity is a ubiquitous 
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phenomenon spanning across diverse network environnnents. In previous work, we have 
explored the feasibility of exploiting long-range correlation structure in self-similar traffic for 
congestion control. We have advanced the framework of multiple time scale congestion 
control and shown its effectiveness at enhancing performance for rate-based feedback 
control. In this article, we extend the multiple time scale co ... 

Keywords: TCP, congestion control, multiple time scale, network protocols, performance 
evaluation, self-similar traffic, simulation 



^7 Heterogeneous Beliefs. Risk and Learning in a Simple Asset Pricing Model I I 

Carl Chiarella, Xue-Zhong He 

February 2002 Computational Economics, Volume 19 issue i 

Full text available !^ Additional Information: full citation , abstract , references , citings , index 

' ^ Publisher Site terms 

Trade among individuals occurs either because tastes (risk aversion) differ, endowments 
differ, or beliefs differ. Utilising the concept of 'adaptively rational equilibrium' and a recent 
framework of Brock and Hommes [6, 7] this paper Incorporates risk and learning schemes 
into a simple discounted present value asset price model with heterogeneous beliefs. 
Agents have different risk aversion coefficients and adapt their beliefs (about future 
returns) over time by choosing from different predi ... 



Keywords: asset pricing, bifurcation, heterogeneous beliefs, risk 



1 8 Artificial intelligence approaches to software engineering: Using genetic algorithms and I I 

coupling measures to devise optimal integration test orders 
Lionel C. Briand, Jie Feng, Yvan Labiche 

July 2002 Proceedings of the 14th international conference on Software engineering 
and knowledge engineering 

Full text available: ^ pdf(94.62 KB) Additional Information: full citation , abstract , references , citings 

We present here an improved strategy to devise optimal integration test orders In object- 
oriented systems. Our goal Is to minimize the complexity of stubbing during integration 
testing as this has been shown to be a major source of expenditure. Our strategy to do so 
is based on the combined use of inter-class coupling measurement and genetic algorithms. 
The former Is used to assess the complexity of stubs and the latter is used to minimize 
complex cost functions based on coupling measurement. Us ... 

Keywords: genetic algorithms. Integration order, Integration testing, object-oriented 
software engineering 



^ 9 LANDMARC: indoor location sensing using active RFID 
Lionel M. Ni, Yunhao Liu, Ylu Cho Lau, Abhishek P. Patil 
November 2004 Wireless Networks, volume lo issue 6 

Full text available: "gj pdf(472.08 KB) Additional Information: fuH citation , abstract , references , index terms 

Growing convergence among mobile computing devices and embedded technology sparks 
the development and deployment of "context-aware" applications, where location is the 
most essential context. In this paper we present LANDMARC, a location sensing prototype 
system that uses Radio Frequency Identification (RFID) technology for locating objects 
inside buildings. The major advantage of LANDMARC is that it improves the overall accuracy 
of locating objects by utilizing the concept of reference tags. ... 

Keywords: RFID, location-aware computing, sensing network, signal strength, wireless 
communication 
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20 Algorithm 827: irbleigs: A MATLAB program for computing a few eiqenpairs of a large I I 

sparse Hermitian matrix 

J. Baglama, D. Calvetti, L. Reichel 

September 2003 ACM Transactions on Mathematical Software (TOMS), Volume 29 issue 3 
Full text available: ^pdfd 19.50 KB) Additional Information: full citation , abstract , references , index ternns 

Irbleigs is a MATLAB program for computing a few eigenvalues and associated eigenvectors 
of a sparse Hermitian matrix of large order n. The matrix is accessed only through the 
evaluation of matrix-vector products. Working space of only a few n-vectors Is required. 
The program implements a restarted block-Lanczos method. Judicious choices of 
acceleration polynomials make it possible to compute approximations of a few of the largest 
eigenvalues, a few of the smallest eigenvalues, or ... 

Keywords: Block Lanczos method, eigenvalue computation, generalized eigenproblem, 
polynomial acceleration, singular values 
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... However partition sampling provides more advantages with similar ... Figure (2) presents 
results on a real database. ... We use Latent Semantic Analysis, as described ... 
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This dissertation is composed of three parts. Each of them describes a 
new computational method specifically developed for assisting the rational 
drug design and discovery, either ligand-based or receptor-based. 

Recursive partitioning is a powerful data mining technique and has 
been successfully applied to large chemical data sets like HTS data sets, 
however, the previous work was limited to 2D descriptors, while medicinal 
chemists believe that drug molecules exert their pharmaceutical functions 
in the three dimensions. So, reported here is my work extending the former 
recursive partitioning analysis 1 8 to the three dimensions, using 3D 
sldquo; atom” pairs as molecular descriptors. Correct 3D 
structure-activity relationships were successfully derived from a data set 
Containing 1,644 monoamine oxidase inhibitors. 

Based on the successful 3D recursive partitioning work, a novel 
computational program, SCAMPI ( Statistical Classification of Activities 
of Molecules for Pharmacophore Identification) , is developed for 
identifying pharmacophores from large chemical data sets. SCAMPI combines 
recursive partitioning and fast conformational search methods and make them 
dependent on each other in the pharmacophore identification process, by 
following the adaptive sampling strategy. The recursive partitioning 
algorithm implemented in SCT^PI belongs to the class of CHAID algorithms. 
The conformational search algorithm in SCAMPI is developed based on the 
“dif ferential distance equation” algorithm. Presently, SCAMPI 
is able to derive pharmacophores from 1 , OOO– 2, 000 compounds within 
one day of computation on a SGI RIOOOO machine. 

For receptor-based drug design, a generalized linear response method 
is developed for facilitating the hydration and binding free energy 
calculations. This new method is based on the standard linear response 
approximation and extends it to the van der Waals contribution term. 
Compared with other linear response methods for free energy calculations, 
this method does not contain any new empirical parameters. This method has 
been tested for hydration and binding free energy calculations and 
demonstrated to provide the calculated results consistent with the 
experimental data in the both cases. 
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This research study presents the mathematical basis for building the 
MC-HARP data-processing environment. The MC-HARP strategy determines the 
functional structure and parameters of a mathematical model simultaneously. 
A Monte Carlo (MC) strategy combined with the concept of Hierarchical 
Adaptive Random Partitioning (HARP) and fuzzy subdoraains determines the 
multivariate parallel distributed mappings. The constructed mapping can be 
modeled as a neural network. The HARP algorithm is based on a 
divide-and-conquer strategy that partitions the input space into 
measurable connected subdomains and builds a local approximation for the 
mapping task. Fuzziness promotes continuity of the mapping constructed by 
HARP and smooths the mismatching of the local approximations in the 
neighboring subdomains. The Monte Carlo superposition of a sample of 
random partitions, reduces the localized disturbances among the fuzzy 
subdomains, controls the global smoothness of the mean average mapping, and 
improves the generalization of the constructed mapping. 

The tree structure of the HARP modules and the independence of both 
the subdomain approximations and the random partitions enable the MC-HARP 
environment to quickly converge to a series of equally plausible solutions 
without user interaction. The MC-HARP environment enjoys a large-scale 
granularity produced by the Monte Carlo parallelism and the geometric 
parallelism achieved by partitioning the input space. Therefore this 
environment can exhibit good performance on parallel computers for large 
and complex scientific databases . 

The developed MC-HARP philosophy for building data - based 
approximate mappings leads to a novel model selection criterion and an 
original framework for classifying data-fitting problems. The MC-HARP 
environment not only can build approximate multivariate mappings with 
self-organization capability, noise and fault tolerance, adaptivity, 
generalization, highly plastic and stable learning characteristics with 
respect to the addition of new data points, and parallel structure but also 
can answer fundamental questions in data - based mathematical modeling. 
These questions include: (1) What is the confidence level for each 
predicted output of the constructed model? (2) What is the approximation 
confidence measure for the constructed model? (3) How does the functional 
complexity of the actual multivariate mapping change over the input space? 
(4) What is the suitable structural complexity for a data - based model 
using noisy data? (5) What is the level of noise in the data? (6) Is the 
amount of training data adequate? If not, which regions of the input space 
need more data? (7) Is the selected parametric model suitable? (8) What is 
the conditioning of a data-fitting problem? (9) Is data - based 
mathematical modeling promising for the given task? 

The developed MC-HARP environment can support the diverse needs of the 
scientific and engineering community. It has the versatility to develop and 
verify parametric and nonparametric mathematical models and also global and 
local approximate mappings. Furthermore, It establishes an environment for 
unifying existing mathematical modeling techniques in statistics , 
approximation theory, information theory, system identification, and neural 
networks. 
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Statistical pattern recognition techniques may be applied to 
cardiograms for automated diagnosis. The three vectorcardiographic signals 
in the Frank-orthogonal-lead system are expressed as truncated 
Karhunen-Loeve expansion in terms of a set of time-varying orthogonal basis 
vectors. These vectors are derived from the second-order statistics of 
the data. In addition to the basic formulation of the algorithm, an elegant 
proof of its minimizing property is presented. 

An ensemble of 670 cardiograms is being used to train the 
algorithm, and the resulting pattern vectors are clustered in a 
multidimensional features space. Baseline restoration is first performed on 
the data using a true third-order spline technique for best Y (t) -estimate 
of the baseline. Ordinates, Y, in the estimates are directly deducted from 
the P-Q interval of the waveform. The cardiogram is an ensemble of 
quasi-stationary processes; this is due to variations in both R-R and P-R 
intervals. As a feature extractor, the K-L expansion is optimal compared 
with Fourier-of performed on non-stationary processes. To achieve 
optimality, the heart is segmented (time partitioned) into two processes, 
namely the P-wave and QRST segment, and each segment is aligned on its 
fiducial point. The R-wave fiducial point is detected by searching the 
magnitude of the vector velocity for maxima. The P-wave fiducials are 
located via a new multi template correlation algorithm. 

Two separate K-L expansions are performed on each process. An 
ensemble-global K-L expansion is performed on the P-processes, to compute 
the P-basis vectors. Further, the ensemble is partitioned 

(ensemble-partitioning) into three partitions: (1) Gross-abnormal: This is 
the partition of gross depolarization abnormals in the QRS Complex (LBBB, 
RBBB, etc) . (2) Ail-But-Gross: This is the partition of all other 
abnormalities. (3) QRS Suppressed: This is the partition of certain 
repolarization abnormals and normals (ST, T, etc) . A partition-global K-L 
expansion is then performed on the QRST process in each partition, yielding 
a set of K-L vectors for each. The underlying concept here is, since the 
class distribution probabilities are not Icnown apriori, a much more 
efficient feature extractor would result if the ensemble is partitioned. 

In addition, since the K-L expansion is optimal using the least 
mean square error criteria, and since the PQRST is heavily weighted by the 
QRS complex (energy wise) , and to improve on classification accuracies in 
the post QRS segment, the waveform is time weighted (or QRS-Suppressed) 
before computing the partition base functions. This is accomplished by 
multiplying each time-varying sample of the partition by a weighting 
function. By suppressing the QRS, the basis vectors best represent 
repolarization classes. 

The approach to the pattern recognition problem is hierarchical: 
(1) Find a first-cut classification of pattern vectors-using 
ensemble-global K-L expansion. (2) Pursue a much more accurate 
diagnosis/classification using partition-global K-L expansion. 

To completely formulate the classification problem, the structure 
of the feature space is studied, using a fuzzy clustering algorithm with 
supervised seeding and class-dependent fuzziness. The underlying concept 
here is, since classes in the feature space are overlapping to various 
degrees, parametrization is best estimated using the fuzzy approach. This 
is an extremely innovative concept (compared with hard-clustering) in 
handling the following two problems: (1) It allows for slight 
misclassif ication errors on part of the Cardiologist (is the diagnosis 100% 
certain abnormality or is it 95% and 5% others?) (2) It gives quantitative 
measure of probabilities of each of a disease state in multiple-diagnosis 
vectorcardiograms. Probability measures are proportional to some membership 
function measures. 

The clusterer described above is performed on a subset of the 
data - base that includes relatively nonempty sets of pure classes and one 
multiple diagnosis class. Members of the multiple diagnosis class are found 
to be best characterized as being cases with multiple membership functions 
to the adjacent pure classes, rather than being a class of their own. 
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A method of sorting large textual data - bases by computer using 
external storage is proposed. The range of sort-keys in a sample of data 
to be sorted is divided into a fix set of partitions, which should also 
give an adequate representation of new data from a similar source. The 
partitions are composed of ordered key ranges . An incoming data stream 
is distributed into a series of bins according to the partition in which 
the key lies, and the bins are then separately sorted, using an internal 
sort, to give an ordered file. It is shown how the number of disc accesses 
needed depends on the manner in which the bins become filled, and thus on 
statistics of the data. Experiments using an inspec data - base give 
information on which estimates of the efficiency of the method can be based 
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Author(s): Cardoso, M.G.M.S.; Themido, I.H.; Pires, P.M. 

Author Affiliation: Inst. Superior Tecnico, Tech. Univ. Lisbon, Portugal 
Journal: Intelligent Data Analysis vol.3, no. 6 p. 491-510 
Publisher: Elsevier, 

Publication Date: Dec. 1999 Country of Publication: Netherlands 
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Abstract: Discusses the evaluation of a clustering solution. Criteria 
based on the number of clusters and discrimination and classification 
processes are used to evaluate the clustering solution. The proposed 
approach is based on two paradigms: statistics and machine learning. A 
multi-methodological approach is advocated in the construction of models 
associating properties with clusters, to provide a wider and richer set of 
analysis perspectives and better knowledge discovery. Specifically, the 
construction of classification and discrimination logical models as a 
complement of quantitative statistical models is particularly useful when 
most of the available information is of a qualitative nature. Both the 
classification's global precision and the comprehension added by the 
discriminant model to the association between variables and clusters are 
essential to evaluate a clustering solution. Depending on the dimension of 



the sample , the descriptive analysis performed can be validated by 
partitioning the total sample into two or by other procedures of 
cross-validation. The proposed evaluation approach is applied to a 
marketing/tourism case study. The clustering solution is built upon a 
sample of more than 2,500 Portuguese clients of Pousadas Portugal Hotels. 
The database includes variables related to the evaluation of client stays 
at the Pousadas and profiles of the surveyed clients on holidays, 
demographic and psychographic aspects. Measures of association, chi /sup 2/ 
tests, ANOVA, discriminant analysis, logistic regression and rule induction 
are applied in evaluating the clustering solution built through a K-means 
process. (14 Refsl 
Subfile; C 
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Author Affiliation: Dept. of Civil Eng., Illinois Univ., Urbana, IL, USA 
Journal: Mathematical and Computer Modelling vol.22, no. 8 p. 73-90 
Publication Date: Oct. 1995 Country of Publication: UK 
CODEN: MCMOEG ISSN: 0895-7177 

U.S. Copyright Clearance Center Code: 0895-7177/95/$9 . 50+0 . 00 
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Abstract: Establishes the mathematical basis for building the MC-HARP 
data-processing environment. The MC-HARP strategy determines the functional 
structure and parameters of a mathematical model simultaneously. A Monte 
Carlo (MC) strategy combined with the concept of Hierarchical Adaptive 
Random Partitioning (HARP) and fuzzy subdoraains determines the 
multivariate parallel distributed mapping. The HARP algorithm is based on a 
divide-and-conquer strategy that partitions the input space into 
measurable connected subdomains and builds a local approximation for the 
mapping task. Fuzziness promotes continuity of the mapping constructed by 
HARP and smooths the mismatching of the local approximations in the 
neighboring subdomains. The Monte Carlo superposition of a sample of 
random partitions reduces the localized disturbances among the fuzzy 
subdomains, controls the global smoothness of the mean average mapping, and 
improves the generalization of the approximation. The authors illustrate 
the procedure by applying it to a two-dimensional surface fitting problem, 
(23 Refs) 
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local approximation; mapping task; fuzziness; global smoothness; mean 
average mapping; two-dimensional surface fitting problem 
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Statistical analysis of large data sets often requires an initial data 
editing and preparation phase to check the validity of individual data 
items, check for consistency among related data, correct erroneous data, 
and supply (impute) values for missing data where possible. During this 
preparatory phase of analysis , it is often necessary to partition the 
data set into a number of subsets by logical selection and/or random- 
sampling techniques for purposes of hypothesis testing. This paper 
examines the data-management support required by these editing and 
subsetting operations in terms of lower-level data-manipulation functions 
and mappings between logical and physical data structures. Advantages of 
transposed data files for statistical applications are discussed in 
comparison with record-based structures. A specific self-describing 
transposed-f ile design is described in detail, with emphasis on 
representations of logical data structures commonly encountered in 
statistical databases . (EF^ citation 08:043115) 
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354000098673190040 
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Document Type: P (Serial) ; A (Analytic) 
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Language: English 

The goal of record linkage is to link quickly and accurately records that 
correspond to the same person or entity. Whereas certain patterns of 
agreements and disagreements on variables are more likely among records 
pertaining to a single person than among records for different people, the 
observed patterns for pairs of records can be viewed as arising from a 
mixture of matches and nonmatches. Mixture model estimates can be used to 
partition record pairs into two or more groups that can be labeled as 
probable matches (links) and probable nonmatches (nonlinks) . A method is 
proposed and illustrated that uses marginal information in the database 
to select mixture models, identifies sets of records for clerks to review 
based on the models and marginal information, incorporates clerically 
reviewed data, as they become available, into estimates of model 
parameters, and classifies pairs as links, nonlinks, or in need of further 
clerical review. The procedure is illustrated with five datasets from the 
U.S. Bureau of the Census. It appears to be robust to variations in 
record-linkage sites. The clerical review corrects classifications of some 
pairs directly and leads to changes in classification of others through 
reestimation of mixture models. 
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Corporate Source: Nanyang Technological Univ, Singapore, Singapore 
Conference Title: Proceedings of the 1997 1st International Conference on 

Information, Conununi cat ions and Signal Processing, ICICS. Part 2 (of 3) 
Conference Location: Singapore, Singapore Conference Date: 

19970909-19970912 
Sponsor: IEEE 

E.I. Conference No.: 48010 

Source: Trends in Information Systems Engineering and Wireless Multimedia 
Communications Proceedings of the International Conference on Information, 
Communications and Signal Processing, ICICS v 2 1997. IEEE, Piscataway, NJ, 
USA. p 800-804 

Publication Year: 1997 

CODEN: 002795 

Language: English 

Document Type: CA; (Conference Article) Treatment: T; (Theoretical) 
Journal Announcement: 9804W4 

Abstract: Vertical Partitioning of Object Oriented Databases (OODBs) is 
a difficult problem. In the present paper, we present simulated annealing 
(SA) approach for generating partitions which are suitable for asynchronous 
parallel processing of queries . We study two cost functions for SA and 
compare the resulted partitions with respect to irrelevant 10, % 
distribution of 10 load for transactions across the processing nodes and 
the standard deviation of the partition sizes which determines the 
load balance in the asynchronous parallel query processing. The results are 
compared with one of the existing vertical partitioning algorithms. (Author 
abstract) 9 Refs. 
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Conference Title: Proceedings of the Eleventh International Conference on 
Information and Knowledge Management. CIKM 2002 p. 60-7 
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Publication Date: 2002 Country of Publication: USA xiv+690 pp. 
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Abstract: In decision support systems, having knowledge on the top k 



values is more informative and crucial than the maximum value. 
Unfortunately, the naive method involves high computational cost and the 
existing methods for range-max querying are inefficient if applied 
directly. We propose a pre-computed partition top method (PPT) to partition 
the data cube and pre-store a number of top values for improving query 
performance. The main focus of this study is to find the optimum values for 
two parameters, i.e., the partition factor (b) and the number of pre-stored 
values (r) , through an analytical approach. A cost function based on 
Poisson distribution is used for the analysis. The analytical results 
obtained are verified against simulation results. It is shown that the PPT 
method outperforms other alternative methods significantly when proper b 
and r values are used. (14 Refs) 
Subfile: C 

Descriptors: data mining; database management systems; decision support 
systems; performance evaluation; query processing 

Identifiers: precomputed partition top method analysis; range top k 
queries; OLAP data cubes; decision support systems; range-max query; data 
cube partitioning; Poisson distribution based cost function; PPT method; 
online analytical processing; search space reduction; query algorithm; 
physical storage method; decision making environment; LPC 

Class Codes: C6160 (Database management systems (DBMS)); C7102 ( 
Decision support systems); C6130 (Data handling techniques) 

Copyright 2003, lEE 



27/5/8 (Item 3 from file: 2) 

DIALOG (R) File 2:INSPEC 

(c) 2003 Institution of Electrical Engineers. All rts. reserv. 

7012612 INSPEC Abstract Number: C2001-09-6160Z-031 
Title: Improving temporal joins using histograms 
Author (s): Slitzmann, I.; Stuckey, P.J. 

Author Affiliation: Dept. of Comput. Sci. & Software Eng., Melbourne 
Univ., Parkville, Vic, Australia 

Conference Title: Database and expert systems applications. 11th 
International Conference, DEXA 2000, Proceedings (Lecture Notes in Computer 
Science Vol.1873) p. 488-98 

Editor(s): Ibrahim, M.; Kung, J.; Revell, N. 

Publisher: Springer-Verlag, Berlin, Germany 

Publication Date: 2000 Country of Publication: Germany xix-fl003 pp. 
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Conference Title: Database and Expert Systems Applications. 11th 
International Conference, DEXA 2000. Proceedings 
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Abstract: Histograms are used in most commercial database systems to 
estimate query result sizes and evaluation plan costs. They can also be 
used to optimize join algorithms. The authors consider how to use 
histograms to improve the join processing in temporal databases . We 
define histograms for temporal data and a temporal join algorithm that 
makes use of this histogram information. The join algorithm is a temporal 
partition- join with dynamic buffer allocation. Histogram information is 
used to determine partition boundaries that maximize overall buffer 
usage. We compare the performance of this join algorithm to temporal join 
evaluation strategies that do not use histograms, such as a partition-based 
algorithm based on sampling and a partition- join using the Time Index, an 
index structure for temporal data. The results demonstrate that the 
temporal partition- join is substantially improved through the incorporation 
of histogram information, showing significantly better performance than the 
sampling based algorithm and achieving equivalent performance to the Time 
Index join without requiring an index. (12 Refs) 
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Treatment: Theoretical (T) 

Abstract: This paper presents a general methodology for the efficient 
parallelization of existing data cube construction algorithms. We describe 
two different partitioning strategies, one for top-down and one for 
bottom-up cube algorithms. Both partitioning strategies assign subcubes to 
individual processors in such a way that the loads assigned to the 
processors are balanced. Our methods reduce inter-processor communication 
overhead by partitioning the load in advance; they enable code reuse by 
permitting the use of existing sequential data cube algorithms for the 
subcube computations on each processor. This supports the transfer of 
optimized sequential data cube code to a parallel setting. The bottom-iip 
partitioning strategy balances the number of single attribute external 
memory sorts made by each processor. The top-down strategy partitions a 
weighted tree in which weights reflect algorithm specific cost measures 
like estimated group-by sizes . Both partitioning approaches can be 
implemented on any shared disk type parallel machine. Experimental results 
presented show that our partitioning strategies generate a close to optimal 
load balance between processors. (27 Refs) 
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Abstract: Presents a' functional testing tool for the BADA III DBMS , 
which is an object-oriented database system under development in the 
Electronics and Telecommunications Research Institute. The system 
architecture and characteristics of the testing tool, test databases , 
design principles for test cases and implementation issues are described in 
detail. The schema of the test databases is constructed to be suitable 
for the object-oriented paradigm, and the instances are synthesized to help 
the user to understand easily. The test tool features test independence and 
self-evaluation, and has been developed to verify all the functionalities 
of BADA-III/C++. Each test case has been derived under eight design 
principles that are essentially based on various black-box techniques, such 
as equivalent partitioning , boundary -value analysis and error 
guessing. The testing tool offers 966 test cases in total, in 167 test 
programs. (12 Refs) 
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An aspect graph plays an important role in three dimensional object 
recognition. Its represents the three-dimensional shape of an object by its 
two dimensional qualitative views as seen from various viewpoints. To 
create the aspect graph of an object, the viewpoint space is partitioned 
into regions, each of which corresponds to qualitatively similar 
projections of the object. Algorithms for creating aspect graphs of 
polyhedral objects have been developed. We developed an algorithm to 
compute the aspect graph of a curved object. Our approach partitions the 
viewpoint space by computing boundary viewpoints from the shape 
descriptions of the object given in a computer aided design database . 
These computations are formulated from the understanding of visual events 
and the locations of corresponding viewpoints. We also studied new visual 
events for piecewise smooth objects. 
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ABSTRACT: Due to the large size of modern databases , it might be useful 
to divide an entire database into smaller partitions. Three basic forms 
of partitioning exist: round robin, hash-based, and range. For specific 
purposes, hybrids of all three can also be made. Round robin, the easiest 
way to partition , guarantees almost equal partition size . With 
range partitioning , a certain defined range of a record value goes 
into the various data stores. Hash-based partitioning is a more abstract 
form of range partitioning. 



File 275:Gale Group Computer DB(TM) 1983-2003/Aug 08 

(c) 2003 The Gale Group 
File 621:Gale Group New Prod.Annou. (R) 1985-2003/Aug 08 

(c) 2003 The Gale Group ' 
File 636:Gale Group Newsletter DB(TM) 198'7-2003/Aug 08 

(c) 2003 The Gale Group 
File 16:Gale Group PROMT(R) 1990-2003/Aug 08 

(c) 2003 The Gale Group 
File 160: Gale Group PROMT (R) 1972-1989 

(c) 1999 The Gale Group 
File 148:Gale Group Trade & Industry DB 1976-2003/Aug 08 

(c)2003 The Gale Group 
File 624:McGraw-Hill Publications 1985-2003/Aug 07 

(c) 2003 McGraw-Hill Co. Inc 
File 15:ABI/Inform(R) 1971-2003/Aug 07 

(c) 2003 ProQuest Info&Learning 
File 647:CMP Computer Fulltext 1988-2003/Jul W2 

(c) 2003 CMP Media, LLC 
File 674 -.Computer News Fulltext 1989-2003/Aug Wl 

(c) 2003 IDG Communications 
File 696: DIALOG Telecom. Newsletters 1995-2003/Aug 07 

(c) 2003 The Dialog Corp. 
File 369:New Scientist 1994-2003/Aug Wl 

(c) 2003 Reed Business Information Ltd. 



Set 


Items 




Description 


31 


1806855 




DATABASE? ? OR DATA () BASE? ? OR REPO8IT0R??? OR DBM OR DBMS 






OR RDBM OR RDBMS 


S2 


20419 




81 (5N) (DUPLICAT? OR REPLICA? OR COPY??? OR COPIE? ? OR REP- 






RODUC?) 


S3 


3918 




PARTITION? ( 5N) (DETERMIN? OR ESTIMAT??? OR ANALYZ? OR ANALY- 




S? 


OR ASSESS? OR CALCULAT? OR ASCERTAIN? OR COMPUTE OR COMPUT- 






ES 


OR COMPUTED OR COMPUTING OR GAUG? OR EVALUAT? OR FIGURED OR 






FIGURING OR MEASUR? OR DEFIN?) 


84 


2366 




PARTITION? (5N) (SIZE? ? OR SIZING OR BOUNDAR??? OR RANGE? ? 




OR 


EXTENT? ? OR MAGNITUDE? ?) 


S5 


1095652 




SAMPL??? 


S6 


1673897 




STATISTIC?? 


S7 


0 




S2(S)S3:84 (8)S5(S)S6 


88 


5 




81(8)83:84 (S)S5{S)S6 


89 


32 




SI (8)83:54 (S)S5:S6 


SIO 


32 




88:89 


Sll 


28 




RD (unique items) 


812 


24 




810 NOT PD>20010702 


813 


194 




84 (5N) (DETERMIN? OR ESTIMAT??? OR ANALYZ? OR ANALYS? OR AS- 



SESS? OR CALCULAT? OR ASCERTAIN? OR COMPUTE OR COMPUTES OR CO- 
MPUTED OR COMPUTING OR GAUG? OR EVALUAT? OR FIGURED OR FIGURI- 
NG OR MEASUR? OR DEFIN?) 

814 24 513(8)85:86 

815 18 RD (unique items) 

816 27 PARTITION? ?( ION) 85 ( ION) 86 

817 23 RD (unique items) 



12/3, K/1 (Item 1 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2003 The Gale Group. All rts. reserv. 

02483843 SUPPLIER NUMBER: 70909008 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Look Before You Leap. (Technology Information) 
SARADHI, VIJAY; SIMONEAU, MARTIN 
Intelligent Enterprise, 4, 3, 40 
Feb, 2001 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2784 LINE COUNT: 00223 

...ABSTRACT: re-engineering. Managers must perform extensive evaluations 
of software, hardware and the overall data model before designing the ETL 
process. The next steps are to size the database and determine 
partitioning strategy. Building the actual prototype involves first 
identifying its focus and then creating database objects using the models 
and tools selected. A prototype should be populated with a good-size 
sample of real data if possible. Generating reports and running ad-hoc 
queries helps test a warehouse. The final step is to digest the results 
from. . . 
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PCI 2.1 compliant. The GFX-500F costs 3119. 
SOFTWARE 

Candle's IntelliWatch Pinnacle 99 for Lotus Notes features new 
replication monitoring and assurance capabilities. Statistics measure 
replication performance at the individual database level. It also offers 
automatic detection, correction, and problem notification. IntelliWatch 
Pinnacle 99 costs $4,800 per single partitioned server. ... OnMark 2000 
Assess 4.0 from Viasoft scans PCs for Y2K issues in hardware/BIOS, 
applications, and data files. It automatically expands two-digit years in 
Excel spreadsheets and lets organizations scan compressed and archived 
databases for Y2K compliance. OnMark 2000 Assess 4.0 costs $4 9. 
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... of Red Brick Warehouse consists of three components: a database 
server, a load subsystem, and gateway technologies for client/server 
access. 

Red Brick's relational database server was designed to support 
databases typically larger than 500GB with billions of records. It uses 
compact representations for numeric data and compressed ...it employs 
parallel scanning, parallel joining, and trademarked technologies it calls 



parallel-on-demand and parallel SuperScan. Using parallel-on-demand, the 
Red Brick query analyzer partitions queries for the optimal degree of 
parallelism, where it considers the query's complexity, the tables* 
partitioning, and the available resources. For example, it allocates... 
...execution methods for the subsequent steps. In this way, it eliminates 
the perfor mance problems sometimes introduced when a query optimizer uses 
out-of-date statistics . 

Red Brick supports conventional B-tree, star, and target indexes for 
different types of queries. Star indexes are automatically built when 
tables are created -- they. . . 

12/3, K/4 (Item 4 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
(c) 2003 The Gale Group. All rts. reserv. 

01848464 SUPPLIER NUMBER: 17587513 {USE FORMAT 7 OR 9 FOR FULL TEXT) 

Joe Celko's SQL for Smarties : Advanced SQL Programming, (book reviews) 

Frank, Maurice 

DBMS, v8, nil, p36(l) 

Oct, 1995 

DOCUMENT TYPE: Review ISSN: 1041-5173 LANGUAGE: English 

RECORD TYPE: Full text 

WORD COUNT: 372 LINE COUNT: 00032 

... trees (hierarchical relationships between data values}. Celko shows 

how to solve problems that many people claim SQL is incapable of handling, 
such as calculating simple statistics (median, mode, variance, and 
standard deviation), running totals, rankings, and subsets (that is, top 
10), and cross tabulations. Most examples are based on ANSI/ISO SQL-92, but 
the author also discusses proprietary features in popular SQL DBMS 
products. 

This is definitely a tips and tricks book, but Celko never neglects to 
explain why some approaches work better than others. He frequently shows... 
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... the data requirements are," Gold-Bernstein said. "Sometimes you 

cannot predict how people will randomly ask questions" of a database, for 
example . 

Sarma advocates a statistical approach. Compiling figures on 
database calls and physical I/Os gives managers a basis for assessing 
network load, he said. 

Application partitioning can also be a powerful tool for managing 
network load and making the best use of CPUs on both client and server 
machines. But Sarma... 
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Center, San Jose, CA 95120. 
Further Results on the Security of Partitioned Dynamic 
Statistical Databases Mary McLeish 

Partitioning is a highly secure approach to protecting statistical 
databases . When updates are introduced, security dependents on putting 
restrictions on the sizes of partition sets which may be queried. To 
overcome this problem, attempts have been made to add "dummy" records. 
Recent work has shown that this leads to. . . 
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DOCUMENT TYPE: technical ISSN: 0362-5915 LANGUAGE: ENGLISH 

RECORD TYPE: ABSTRACT 

ABSTRACT: Partitioning as a means of protecting statistical data bases 
is a highly secure approach. Maintaining security during updates requires 
restricting the sizes of partition sets which may be queried. Adding 
dummy records to overcome this problem has been shown to cause high 
information loss. A model is presented which... 

...and alternatives to adding dummy records presented. The security problem 
is examined, with if and only if conditions considered. Security is found 
to hold if partition sizes are kept even. The practical implications of 
this model for the database manager are considered. 
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TEXT: 

...pointers to custom written logic routines; descriptions of object 
behaviors and relationships between objects; input device bindings to 
actions; collision callback mapping; playback constraints; LOD definition 
; BSP partitions ; structural database definition ; and networking 
protocols. Activation's features can be accessed by designers through 
point-and-click mouse commands, or by programmers through taggable code 
assigned to . . . 

...for all leading game platforms, including Sony Playstation, Sega Saturn, 
Nintendo 64 and PCs running Microsoft DOS or Microsoft Windows. Included 
with Activation are three sample games which illustrate the breadth of 
the program's prototyping capabilities. The sample games are Race to Los 
Gatos, a 3D racing game; Mythology Fight, a 3D fighting title; and Space 
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Cadet, an action-oriented space battle game... 
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time graphic displays about transaction activity, CPU and device 
utilization, and network traffic. 

The SYBASE Configurator (TM) product provides capacity planning and 
design modeling for database environments. The software analyzes user 
statistics , capacity requirements, application design information, and 
throughput requirements, and then recommends hardware configurations, 
selects database partitioning and estimates performance. Such 
capabilities are especially important for sites with massive amounts of 
data and high transaction and query volumes. The initial release of 
Configurator is... 
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TEXT: 

...pointers to custom written logic routines; descriptions of object 
behaviors and relationships between objects; input device bindings to 
actions; collision callback mapping; playback constraints; LOD definition 
; BSP partitions ; structural database definition ; and networking 
protocols. Activation's features can be accessed by designers through 
point-and-click mouse commands, or by programmers through taggable code 
assigned to, . . 

...for all leading game platforms, including Sony Playstation, Sega Saturn, 
Nintendo 64 and PCs running Microsoft DOS or Microsoft Windows. Included 
with Activation are three sample games which illustrate the breadth of 
the program's prototyping capabilities. The sample games are Race to Los 
Gatos, a 3D racing game; Mythology Fight, a 3D fighting title; and Space 
Cadet, an action-oriented space battle game. . , 
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1, 2, 5 and 10 years are employed. 

Cash flow from operations and operating accruals data were generously 
supplied by Percy and Stokes (1992). Their sample comprised 107 firms for 
which the information required to calculate the cash flow from operations 
and operating accruals measures was obtained from the AGSM Annual Report 
File. This sample represents all of the firms existing in Australia from 
1975 to 1985 for which data are available on the AGSM database . The time 
period is comparable with the second half of EHO*s sampling period which 
runs from 1976 to 1986. 

The ten year event window used in this study implies that a firm must 
have ten consecutive years... 
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... the data requirements are," Gold-Bernstein said. "Sometimes you 

cannot predict how people will randomly ask questions" of a database, for 
example . 

Sarma advocates a statistical approach. Compiling figures on 
database calls and physical I/Os gives managers a basis for assessing 
network load, he said. 

Application partitioning can also be a powerful tool for managing 
network load and making the best use of CPUs on both client and server 
machines. But Sarma... 
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disincentives associated with retrospective payment systems (wage 
loss) . Those disincentives have been mentioned in the discussion of the 
preceeding paper. 

Under the suggest system, the data base of disability claims 
(excluding permanent total because of the small sample size ) would be 
partitioned according to major categories. The latter might be linked to 
the functioning of body organs. A distribution of lost work days would be 
estimated for . . . 
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5} More specifically, households whose heads were self-employed, 
retired, not working but not retired, or in the occupational category 
"other" were eliminated from the data base . Next, demographic classes 
were defined in order that the subsamples of households residing in a given 
state could be partitioned . Twelve classes were defined on the basis of 
marital status, education, and race. Any household that did not respond to 
these questions was removed from the sample . These deletions left a total 
of 9,242 households in 41 States to be partitioned. 

Once each state subsample had been partitioned, averages of 
disposable. . . 
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An investigation of the coauthor graph. 
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ABSTEIACT: The structure of coauthor graphs and the statistical validity 
of the associated author partitions are investigated as a function of 
productivity and collaborative thresholds. The productivity threshold 
determines the number of authors (points) in a coauthor graph, and the 
collaborative threshold determines the number of coauthor pairs (lines) in 
the graph. The statistical validity of author partitions is determined 

by the random-graph hypothesis. The results show that for 'small* 
databases , statistically preferred partitions occur when all authors and 
coauthor pairs appear in the graph. For * large' databases , statistically 
preferred partitions occur when authors and coauthor pairs who publish only 
one article are excluded from the graph. Unlike other bibliometric 
relationships, the highly... 

...the collaborative relationship produces a wide range of threshold values 
for which the associated partitions are statistically valid. It remains to 
be shown how the statistical validity of partitions is related to the 
empirical significance of the same partitions. (Reprinted by permission of 
the publisher . ) 
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...TEXT: in this study were conducted in accordance with well-documented 
and verified techniques (3,6,7, 10). The following sections describe these 
procedures . 



SAMPLING 



The sample frame for the study consisted of all secondary solid wood 
products manufacturers in Louisiana. Examples of industry sectors 
represented include hardwood dimension and flooring mills, wood kitchen and 
bath cabinets, wood household furniture, wood office furniture, store 
fixtures, pallets, partitions , etc. There are estimated to be 
approximately 650 companies in this population in Louisiana (12) . The 
primary source of sample frame information was existing industry 
directory databases and directories compiled by the LFPL (4). 

MAIL QUESTI9NNAIRES 

Data collection was done using a mail survey questionnaire. Mail 
questionnaires were chosen as the most... 
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...TEXT: do not have incentives to forecast. Evidence regarding this 
conjecture is presented in the association tests that follow. 

TABLE 3 

Looking at subsamples of the data based on prior-year earnings 
performance reveals that the price-based forecast outperforms the 
financial-statement- analysisbased forecast in both partitions of the 
data. Even in the poor performance partition of the sample where 
transitory earnings are more prevalent, the pricebased forecast still 
appears to generate more accurate predictions. In addition, all three 
forecast sources produce smaller errors... 
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Research note: A study of computer usage and strategic planning in the SME 
sector 

Bridge, John; Peel, Michael J 
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WORD COUNT: 2180 

...TEXT: planners were analysed for both small and medium-sized firms. 
Other than for accounting packages, the results revealed that high planners 
in the small company sample used the various software packages to a 
significantly greater extent than low planners, particularly in respect of 
spreadsheets, databases , MIS and statistical packages. The mean usage 
scores for high planners in the small firm sub- sample for spreadsheets 
(4.24), databases (4.06), MIS (3.39) and statistical packages (2.94), 
were significantly higher than for low planners in the small company 
subsample -who had mean scores of 3.84, 3.52, 2... 
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...TEXT: Barniv and Hathorn (1997) on mergers and insolvency. 

Most life/health solvency studies have appeared after 1990 and also show a 
migration from matched-pairs samples to whole-industry analyses with the 
advent of the NAIC databases . Barniv and Hershbarger (1990) used 
matched-pair sampling of pooled data from 1975 to 1985 to correctly 
classify the insolvency status of between eighty-two and ninety-one percent 
of life insurers one and two years in advance. More recently, Ambrose and 
Carroll (1994) used matched-pair sampling of pooled data from 1969 to 
1986 to predict life insolvencies for 1987 to 1991. They attribute their 
finding of relatively low predictive power to temporal changes in the 
factors responsible for insolvency over long time spans. 3 Using the NAIC 
database for 1986 through 1991, Carson and Hoyt {1995} compared logistic 
regression, recursive partitioning , and discriminant analysis for 
predicting life insolvencies. Although they did not analyze segments, they 
conjectured that "models segregated by insurer size and product line also 
may yield additional... 
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...TEXT: pipeline parallelism. Collectively, the DB2 Family provides 
additional support for data warehouse performance and scalability: 

Parallel-aware, cost-based search optimizers that exploit a wide range of 
database statistics 

Intelligent partitioning 

Parallel database operations, including (but not limited to) parallel table 
and index scans, joins, backup/recovery, and utilities 

Specialized indexes and index processing 

SQL extensions. . . 
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ABSTRACT: In the relational database system, the join operation is one of 
the most important due to its frequent uses, especially if relations are 



normalized. Of several algorithms proposed for... 

. . . can be saved, compared with the sort-based algorithm. The main 
improvement of the hybrid algorithm comes from completely sorting only the 
smaller relation and partitioning the others into ranged buckets 
according to the order statistics of the sorted relation. In analyzing 
the performance of the hybrid join and comparing it to other methods, it is 
shown that the hybrid join. . . 
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ABSTRACT: The structure of coauthor graphs and the statistical validity 
of the associated author partitions are examined as a function of 
productivity and collaborative thresholds. The statistical validity of 
author partitions is determined by reference to the random-graph 
hypothesis. The results indicate that, for "small" databases , 
statistically preferred partitions occur when all authors and coauthor 
pairs appear in the graph. For "large" databases , statistically 
preferred partitions emerge when authors and coauthor pairs who publish 
only one article are excluded from the graph. Unlike other bibliometric 
relationships, the highly. . . 

. . . the collaborative relationship generates a wide range of threshold 
values for which the associated partitions are statistically valid. It 
remains to be demonstrated how the statistical validity of partitions is 
related to the empirical significance of the same partitions. ... 
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ABSTRACT: The sorting of large files of data derived from bibliographic or 
other textual data bases can be an expensive procedure. Therefore, any 
slight increase in the efficiency of sorts can contribute to reduction in 
costs to the users of information services. One method of sorting large 
textual data bases by computer uses external storage and divides the 
range of sort-keys in a sample of data to be sorted into a fixed set of 
partitions. The partitions are composed of ordered key ranges , and an 
incoming data stream is distributed into a series of bins according to the 
partition in which the key lies. The bins are then... 

... sort, to give an ordered file. The number of disc accesses needed 
depends on the manner in which the bins become filled and, thus, on 
statistics of the data. An experiment using an INSPEC data base 
suggests that this method of sorting is feasible and that it is possible to 
generate a partition set from a reasonably small sample of the data to be 
sorted. 
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ABSTRACT: A national cartographic data base is needed which can readily 
produce computer mapping among organizations having much the same needs. 
Many data bases designed to support federal programs could be applied 
by other users. Such data needs to be made more accessible to possible 
users at all levels.... 

. . . which use topological data to describe polygonal features, such as city 
blocks, could also be used to map such things as land use or health 

statistics . A topological data base would have wide applicability. 
The capability to merge 2 cartographic data files into one data base 
has also been developed. Further work should also be done on merging a 
gridded data base with the topological data structure. A gridded data 
base is used in cases where data cannot be partitioned into exactly 

defined boundaries , such as in the case of rainfall. 
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...ABSTRACT: re-engineering. Managers must perform extensive evaluations 
of software, hardware and the overall data model before designing the ETL 
process. The next steps are to size the database and determine 
partitioning strategy. Building the actual prototype involves ' first 
identifying its focus and then creating database objects using the models 
and tools selected. A prototype should be populated with a good-size 
sample of real data if possible. Generating reports and running ad-hoc 
queries helps test a warehouse. The final step is to digest the results 
from. . . 
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... task in a similar fashion. When you launch either program, it 

allocates as much memory as possible for a packet buffer. The buffer size 
is determined by the size of the MultiFinder memory partition and can 
be as large as you want. When you start collecting packets, both programs 
display statistics and charts of the number of packets captured, errors 
found and the amount of network bandwidth being used. 
Packets are displayed in a main window. . . 
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...ABSTRACT: The primary design objective of this controller is to 
maximize the throughput of the signal processing module as well as 
controller task timing and queueing statistics . The model is 
parameterized to allow sensitivity analysis of functional components, 
task partitioning , queue sizing and data input rates. (Reprinted by 
Permission of Publisher.) 
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Word Count: 2182 

gave different results and hence "a specified procedure with 
pre-determined calibration curves has to be followed in order to obtain 
reliable and reproducible results". 

Samples of the PVC films were contacted with different 
concentrations of PVC in the water or oil (in the ranges 50-200, 30-105, 
and 20-50 ppb) and the system stirred until equilibrium was reached, when 
the liquid phase was analysed. VCM in the polymer was estimated by 
difference. 

Partition coefficients (polymer to liquid ranged from ca 1 to 8 
for the corn oil, and 6 to 40 for the water, increasing generally (not 
invariably) with reducing VCM concentration. 
The, . . 
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... homopolymer, or copolymer) after the polymerization. Incorporation 

is favored at lower surfactant and higher initiator concentrations. These 
results are not unexpected. 

SURFACE vs. BURIED: The sample prepared with 30 mM reactive 
surfactant and 8 mM (Na . sub. 2 ) (S . sub. 2) (O. sub. 8) was subjected to further 
analysis to determine the extent of partitioning of the surfactant 
between the surface and interior of the latex particles. By partially 
swelling the particles with THF, ion exchange and titration of the. . . 
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... of more statistically sound measures (see Barber and Lyon (1997) 

and Kothari and Warner (1997)). These better measures include the 
size-adjusted returns we provide. 

Size -adjusted returns are calculated by partitioning the 



highly shorted sample and all Nasdaq stocks into market value deciles at 
the time of each of the 55 announcements. Net-of-size portfolio returns are 
calculated as . . . 
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... in the feed . 

As it is virtually impossible to measure the flow rates of the feed 
(undersize and oversize streams in real time operations) , the partition 
numbers for the various size fractions must be determined from sample 
data gathered in a steady state for the three streams using an analytical 
equation . 

The undersize and oversize partition numbers are derived in Equations 

2. . . 
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p.sub. j) ( (X.sub. ji) ), (17) 

where (X.sub.ji) are independent draws from (p.sub. j) and the 
subscripts on I denote the partition of unity and the sample sizes 
used. The estimate (I.sub.n,w) is unbiased under mild conditions on the 
supports of the function (p.sub.j) and (w.sub.j). 

Veach and Guibas. . . 
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and 20.7 percent in BH terms in the fourth year. (We do not use 
four-year returns elsewhere in the paper.) 

Panel C considers sample partitions by size, book-to-market ratios, 
and time-period. The size partition is based on market capitalization 
measxared at the time of the first financial statement post-IPO with 
cutoffs at $20 million and $100 million (in December 1997 real dollars) . 
The differential. . . 
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... sub.it) in response to a change of one standard deviation in the 

value of this explanatory variable. A comparison of this change to the 
size of the partitions provides a measure of the economic importance 
of a variable. As a further aid in interpreting the probit model, Table III 
contains descriptive statistics of the distributions of the explanatory 
variables by rating category and overall. 

The variance of the standard errors of the probit model, which can be 
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... is provided in Section 3, Example 1. 

2.4 The Power of the Test 

The power of the test depends on the accuracy of the partition 
boundaries as estimates of "true" boundaries , the difference in 
intensities (Lambda) and (Lambda) (prime) , and the size of the region of 
nonhomogeneity (R.sup.A). In the example considered herein, the... 

...than the healthy tissue, then this partitioning scheme has the potential 
to yield greater power than a standard quadrat test or a conventional 
spatial scan statistic . The power of the test based on (2) can be 
calculated exactly. In particular, we consider the best-case scenario in 
which (R. sup. A. . . 
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the market value effects of advertising and R&D from other 
industry-specific valuation considerations. In addition, the basic 
valuation model can be analyzed over samples of advertising-intensive and 



R&D-intensive industries to learn the extent to which the valuation effects 
of advertising and R&D expenditures are mitigated by substantial 
promotional and innovative activity by competitors. By analyzing the 
overall sample of firms, in addition to a simple two-part breakdown for 
manufacturing versus nonmanufacturing firms, it becomes possible to learn 
the extent to which expenditures... 

...and R&D have broad rather than narrow implications for the value of the 
firm. By considering the market value implications of a three-part sample 

partition according to firm size { measured by sales revenue), the 
extent to which firm size plays a role in determining the market value 
effects of advertising and R&D can also... 
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... that this condition guarantees uniqueness. 

Andreatta and Kaufman (1986) adapt Murthy's (1957) estimator, a close 
relative of Horvitz and Thompson's estimator, to successive sampling in a 
different way. If any one population characteristic such as the number N of 
deposits, the sum of all deposit magnitudes or a fractile. . . 

...in-place deposits is assumed to be known with certainty, then this 
knowledge be used to compute an estimate of inclusion probabilities from an 
incomplete sample of the population. They call this "anchored 
estimation," the known population characteristic being the "anchor." An 
application to North Sea data partitioned into seven size classes 
recovers MLE estimates for each of these size classes so closely as to 
suggest a tight link between conditional {on the anchor) .MLE and unbiased 
estimation via anchoring... 
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...TEXT: both subperiods, mimicked the earlier results. Thus, the changed 
relationship between return and beta was not driven by a sector effect. 

Does size matter? To assess the potential confounding effect of size , 
we partitioned stocks into three size groupings based on market 
capitalization in each June. The "large" stock group comprised the largest 
100 stocks; the "medium" group, the next largest 200 stocks; and the 
"small" group, the remaining stocks. The mean monthly return for the entire 
sample period varied significantly with size: Large stocks averaged 1.24 
percent; medium stocks averaged 1.07 percent; and small stocks averaged 
2.35 percent a. . . 
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...TEXT: from studies that examined multiple products. When subjects or 
dependent variables differed, we calculated separate effect size estimates 
across products. The independent variable used to partition studies for 
calculating the effect size estimates was ad format (comparative ad 
versus noncomparative ad) . Comparative ads were those that explicitly or 
implicitly compared the sponsor's brand with another brand in... 

... three ad levels (e.g., direct comparative, Brand X, noncomparative), we 
used direct comparative versus noncomparative ads to calculate the effect 
size estimate. The d- statistic was coded positive if the comparative ad 
produced more favorable results than the noncomparative ad and negative 
otherwise . 

Moderating variables were included in our analysis... 
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...TEXT: the New Zealand audit market, but the interactive nature of the 
relationships among audit firm size, company size, and company listing 
status require a further analysis of the sample partitioned on 
company size (large vs. small) and listing status {listed vs. unlisted). 
The partitioned regression results demonstrate that the Big 5 received fee 
premiums from large listed and. . . 
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...TEXT: size than the incumbent auditor, relative to clients that change 
auditors with no disagreement disclosed. Table 8 presents data regarding 
the incumbent and new auditor partitioned by two different measures of 
size : (1) Big 8 versus Non-Big 8; and (2) total audited sales. (13) (Table 
8 omitted) For the Big 8 versus Non-Big 8 classification... 

...to a Big 8 auditor (22 percent versus 11 percent), a chi sup 2 test on 
the two-by-two contingency table including the 28 sample firms changing 
to different size auditors indicates an insignificant difference between 
the switch behavior of the D group and the NO group. When the Big... 
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...ABSTRACT: which use topological data to describe polygonal features, 
such as city blocks, could also be used to map such things as land use or 
health statistics . A topological data base would have wide applicability. 
The capability to merge 2 cartographic data files into one data base has 
also been developed. Further... 

... be done on merging a gridded data base with the topological data 
structure. A gridded data base is used in cases where data cannot be 
partitioned into exactly defined boundaries , such as in the case of 
rainfall . 
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Inc. (Nasdaq:CYRD) announced today that it has successfully tested 
automatic image recognition features of its digital document processing 
software on a large pre-scanned sampling of diverse microfilm formats. 

The software's innovative statistical modeling algorithms 
automatically locate individual image boundaries on the scanned microfilm 
and partition the digital file into standardized pages, eliminating 
excess data. The company is developing an automatic image recognition, 
enhancement, and restoration solution that is essential for... 
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firms from Canada; and 
8) (m.sub.j) is the disturbance term for firm j in year t. 
Results 

Summary Statistics 

Table Al presents summary statistics of the measuring variables. 
Panel A provides summaries of variables for the total sample , while 
Panels B, C, and D partition the statistics by region and year of 
study. From Panel A, the statistics reveal that the mean market reaction 
(INFOann) to foreign earnings announcement is 1.98 percent of the 
security's price and 1.04 percent during... 
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encoded using only simple sufficient statistics — thus allowing 
models to be updated without the need to rerun past data. Note that the 
number of sufficient statistics for each partition is a linear 
function of its cardinality, which is typically a small fraction of the 
sample size. Thus the notion of interruptibility can be applied when n is 
large. 



APPENDIX A: APPROXIMATE WEIGHTED CHINESE RESTAURANT ALGORITHM FOR 
SINGLE-MEASUREMENT DATA 
Plug. . . 



17/3, K/4 (Item 2 from file: 148) 

DIALOG (R) File 148: Gale Group Trade & Industry DB 
(c)2003 The Gale Group. All rts. reserv. 

15248362 SUPPLIER NUMBER: 94874445 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Branching and competition in the European banking industry. (econometric 

analysis of cost of bank branching and competition in nine western 

European nations) 
Cerasi, Vittoria; Chizzolini, Barbara; Ivaldi, Marc 
Applied Economics, 34, 17, 2213(13) 
Nov 20, 2002 

ISSN: 0003-684 6 LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 8875 LINE COUNT: 00830 

... branches in the sample is comparable to the OECD figures for 

Belgium, Germany, and France, while, for the other countries, larger banks 
are slightly over sampled . 

Table 2 provides some statistics on the partition of the 
observations in the sample according to branching behaviour. It appears 
that, within the set of multi-branch banks, the majority (2565) opened new 
branches or kept their network size... 
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... divide the sample into scanners and non-scanners ! Firms scoring 

above the mean were classified as scanners, those below the mean as 
non-scanners . 

The statistical methods used in testing the hypotheses were 
analysis of variance (ANOVA) and t-tests. ANOVA was used to partition the 

sample into scanners and non-scanners on each scanning index across the 
growth and maturity stages of the industry life cycle. (Growth and maturity 
were the . . . 
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n « 3 Years 
Intercept (( (alpha) . sub. 0) ) 
(t-statistic) 



-1.66 (***) 
(-22.55) 



(FE.sub.t-n) (( (alpha) .sub. D) 0.12 {***) 

(10.09) 

F Statistic 102 
Adjusted (R.sub.2) 0.011 
Sample Size 9,411 
(***. )Significant at the 0.01 level. 

Descriptive Statistics for Partitions of Lagged Residual Forecast 
Error (partitioned by thirds) 

(FE' . sub. j t-n) the forecast error for firm j (i.e., ( (E. sub. 1) - (F. sub 
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... non-Januaries during 1975-84, and non- Januaries during 1985-92. 

Results indicate market beta is not priced for Canadian stocks in any 
of the sample partitions . None of the t- statistics is significant for 
the beta risk premia. In fact, the risk premia for beta are negative, 
(although insignificant) for most of the partitions presented in... 

...S. markets over various subperiods . However, they do not provide 
contrasts between January and non- January. 

Firm size effects for Canadian stocks exist in each sample 
partition presented in Table 3. All of the t- statistics for the 
size-risk premia are significant. Findings indicate the firm size effect is 
noticeably stronger in January than in non-January. For example, over 
cross-sectional regressions and the respective t- statistics are 

reported for each sample partition . The first column presents the 
beta risk premia and respective t- statistics and the second column 
presents the size risk premia and respective t-statistics . 

** Significant at the 5 percent level. 
Our tests show firm size effects... 
1975-84, 1985-89, January only, and non-January 

months. The averages of the gammas estimated from the 
cross-sectional regressions and the respective t- statistics are 
reported for each sample partition . The book-to-market risk premia 
and corresponding t- statistics are presented in the first column The 
size risk premia and corresponding t-statistics are presented in the 
second column . 

** Significant at the 5 percent... 
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the radius (Sigma) of the median filter for larger target signals. 
This independence assumption is key to the derivation in Section 2.2 of the 



sampling distribution of the spatial scan statistic based on the 
stochastic partition . The effect of this presraoothing on the partition 
(W. sub. (Sigma) )( (Zeta) ) is investigated in Section 3.1. 

In summary, given a realization (Zeta), W((Zeta)) = {(R.sub.l), 
(R.sub.K)l produces... 
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WORD COUNT: 94 50 LINE COUNT: 007 63 

... it difficulr for them to retain their position. Also bidders in 

rejected offers are performing insignificantly better than the bidders 
whose offers are accepted (t- statistic = 1.09). 

Panel D of Table 2 partitions the sample according to the form of 
payment. It shows target firm's performance prior to the bid is more 
negative when equity is the form of . . . 
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...60 43 4 4 

1986(*) 32 26 2 1 



Total 800 579 375 191 

* The number of 1986 issues is small because the Registered 
Offering Statistics tape available for this project does not 
include all 1986 registrations. 

TABLE 3 Sample Size of Each Portfolio Partition 

A. Common Stock Offers 

D Precedes B D follows B Row Total 

X precedes B Portfolio 1 Portfolio 2 

75 38 113 

(39.27%) (19... 
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... Roley's analysis covers six years (1977-1982) with 27 changes, and 

Hafer studies the 1977-1984 period with 32 changes. In addition, each study 

partitions the sample into different monetary policy regimes. The 
resulting small sample sizes diminish the chances of finding statistical 

significance even if economic significance exists. 

5 As indicated by Waud (1970), frequently the New York Fed Bank will 
either lead or lag changes made... 
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... size quintile into quintiles according to UEARN. Second, in 

quintiles according to UEARN then within each earnings quintile according 
to SIZE. For each of these partitions we use a statistic based upon 
Jonckheere [1954] which tests for a k- sample trend against ordered 
alternatives. (9) For a given size partition , we expect abnormal returns 
to increase as unexpected earnings increases leading to a significant 
positive TJ statistic across earnings quintiles. These results are 
presented in. . . 
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...TEXT: for this purpose, and this forces us to use cross-sectional data 
to quantify the agerelated differences in inequality. 

Specifically, we do the following: we partition the SCF sample into 10 
cohorts according to the age of the household heads, we compute the 
relevant statistics for each cohort, and we compare them with the 
corresponding statistics for the entire sample. These statistics are the 
cohort average earnings, income, and wealth... 
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...TEXT: and growth are systematically invoked by analysts when summarizing 
the investment potential of stocks. 

In Panel B of Table 3, the second and third columns partition the sample 
based on whether a target price is disclosed, and the final two columns 
present a chi 

sup 2 

statistic and p-value for whether the distribution of each justification 
differs between the two subsamples. Generally, there are two significant 
differences between the reports that... 
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...TEXT; non-Big 6 auditors. The data also indicate that the specialists 
audit the larger insurers (74.6 percent of net premiums written). 

Table 2 reports sample descriptive statistics by audit firm type. 
Univariate t-tests, Chisquare tests, or Wilcoxon rank-sum tests are 
performed across the partitions as a preliminary analysis. Table 2 
indicates that the variable of interest OUTBD, measured as the percentage 
of outside directors on the board of directors... 
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ABSTRACT: Is the Ewens distribution the only one-parameter family of 
partition structures where the total number of types sanpled is a 
sufficient statistic ? In general, the answer is no. It is shown that ail 
counterexamples can be generated via an urn scheme. The urn scheme need 
only satisfy. . . 
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...TEXT: and nonstressed firms (replicating Hopwood et al.) and between 
firms undergoing recession and those not undergoing recession. 

This result is further refined by dividing the sample into the four 
partitions described earlier and testing for incremental improvements 
from stress and recession knowledge. The lambda statistic for the 
difference between the unconditioned and the recession- and stressed- 
controlled model -2LogL measures is 126.783 (24 df ) , and is also 
statistically significant... 
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...TEXT: the sample median and zero otherwise. Untabulated findings reveal 
a significantly positive coefficient on SALESDEPR-c * s estimate is 173.34 
with a White t- statistic of 5.65. 

To use Barth (1994) for illustrating the diagnostic, we partition Barth's 
1989 sample based on a scale proxy, book value of equity for the 
investment securities regressions, and net income for the securities gains 
and losses regressions. Regression... 
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...TEXT: and college training} were not included in any of the final models 
because of insufficient data, or were eliminated because the analyses 
showed no apparent statistical relationship at any reasonable level for 
any factor or respondent subset. Area of employment is used to partition 
the sample into subgroups for further analysis. Variable X sub 5*6 , 
which we label "employer support", is the interaction between encouragement 
and training. Model parameters are... 
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...TEXT: group omega sub i if x is in the region Omega sub i . For the 
financial distress case, the estimation problem is to identify a 
statistical model that best partitions the sample space of companies* 
financial ratios into the two groups: FIC and nonFIC. 

We also confine ourselves to the Bayes minimal risk decision rule, which is 
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...TEXT: tests do reject the hypothesis that the coefficients are the same 
in the two sets of countries and the level of real GDP chosen to partition 
the sample was that which maximized the F statistic in this test. (10) 
These results suggest that the significantly negative impact of a larger 
government share is confined to the high income countries. For... 
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...ABSTRACT: based on the use of binary search trees for tree partitioning. 
With the new method, incoming files are decomposed into partitions with 
binary trees until partitions reach a manageable size for internal 
sorting. The search tree is generated by deriving statistics from a 
small sample of the data to be sorted. The method should be applicable to 
any data characterized by some degree of regularity, such as bibliographic 
and natural ... 
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English Abstract 

A training database (including data mining algorithm descriptions and 
meafeatures characterizing probability density functions of features) in 
the memory and computer readable program code (1) to extract features 
that classify data, (ii) to calculate metafeatures describing the case 
probability density function, and (iii) to select a data mining algorithm 
by using the training database to map the calculated metafeatures 
describing the case probability density funciton to the selected data 
mining algorithm. The frequeny of the occurrence of features with respect 
to datum in teh data defuing a case probability density function. 

French Abstract 

L* invention concerne une base de donnees d'entrainement (comprenant des 
descriptions et des meta-elements d'algorithme d * exploration en 
profondeur de donnees caracterisant des densites de probabilite 
d' elements) logee en memoire, et un code de programme lisible par 
ordinateur destines a: (i) extraire des elements de classement des 
donnees; (ii) calculer des meta-elements decrivant la densite de 
probabilite du cas; iii) choisir un algorithme d ' exploration en 
profondeur de donnees en utilisant la base de donnees d'entrainement pour 
mapper les meta-elements calcules decrivant la densite de probabilite du 
cas relativement a 1* algorithme d' exploration en profondeur de donnees 
choisi. La frequence d' occurrence des elements par rapport a la reference 
dans les donnees definit une densite de probabilite du cas. 

Legal Status (Type, Date, Text) 

Publication 20020919 Al With international search report. 
Examination 20030109 Request for preliminary examination prior to end of 

19th month from priority date 
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Detailed Description 

Detailed Description 

this embodiment may also identify a point of diminishing returns in 
the number of features and estimate feature robustness. The computer 
readable program code to estimate feature robustness may also 
partition the data into subsets, temporally, sequentially, randomly, or 
otherwise. The computer readable program code to estimate feature 
robustness in this embodiment may then calculate the entropy of each 
subsetas a statistical measure of similarity. The computer readable 
program code in this embodiment may also identify parameters (such as 
user preferences, real-time deployment issues, available memory. . . 
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A method of managing storage in a document image database (14) using 
document analysis (32) to partition documents into logical regions 
and modified by reducing storage size of the regions using different 
reduction modifiers according to various storage preference rules (78) . 
Storage preference rules are intended to maintain high quality 
representations of important document information while reducing storage 
requirements at the expense of lesser important aspects of the document. 
In particular, the different reduction modifiers (34) applied to stored 
document images include reducing sampling depth, reducing sampling 
resolution based on minimum font size, utilizing lossy and lossless 
compression schemes and discarding unimportant regions of document image. 
Over time, document analysis and modification can be repeated to further 
reduce the storage size of previously stored data files (50, 52, 54). 

French Abstract 

L' invention concerne un precede permettant de gerer la memoire d*une 
base de donnees d'imagerie documentaire (14) en utilisant 1' analyse de 
documents (32) . Celle-ci permet de diviser les documents en regions 
logiques et de les modifier en reduisant leur volume de memoire. Cette 
derniere operation se fait a l*aide de modif icateurs de reduction et 
selon differentes regies preferentielles de stockage (78) . Ces dernieres 
ont pour but de conserver la haute qualite des representations de 
documents importants tout en reduisant les exigences de stockage aux 
depens de certains aspects des documents qui presentent une moindre 
importance. En particulier, les differents modif icateurs de reduction 
(34) appliques aux images stockees comprennent la reduction de la densite 
d'echantillonnage, la reduction de la resolution d* echantillonnage basee 
sur la police minimum, 1 * utilisation de procedes de compression a pertes 
et sans pertes et 1' exclusion des regions des documents qui ne presentent 



pas d' importance . Par la suite, 1' analyse de documents et la modification 
peuvent etre repetees afin de reduire a nouveau le volume de memoire des 
fichiers precedemment stockes (50, 52, 54). 

English Abstract 

A method of managing storage in a document image database {14) using 
document analysis (32) to partition documents into logical regions 
and modified by reducing storage size of the regions using different 
reduction modifiers according to various storage preference rules (78). 
Storage. . . 

...requirements at the expense of lesser important aspects of the document. 
In particular, the different reduction modifiers (34) applied to stored 
document images include reducing sampling depth, reducing sampling 
resolution based on minimum font size, utilizing lossy and lossless 
compression schemes and discarding unimportant regions of document image. 
Over time, document analysis and modification... 
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ABSTRACT EP 992909 A2 

A method and apparatus for distributing computer resources in a network 
environment. A networl^ of computer systems is partitioned into at least 
one computing system partition , and is configured into at least one 
redundancy group. The computing system partitions include 
applications, computing system nodes, and copies of a database 
schema. The copies of the database schema are replicated at each 
computing system partition within a network. The computing system 
partition manages interactions between the instances, the computing 
system nodes, and the copy of the database schema within the 
respective computing system partition . The redundancy group comprises 
at least one computing system and at a plurality of computing system 
partitions , and manages the replication of the database schema 
within the computing system and computing system partitions . 
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at least one computing system partition , including at least one 
instance of an application, at least one computing system node, and 
at least one copy of a database schema, the copies of the 
database schema being replicated at each computing system 
partition within a network, and wherein each computing system 
partition manages interactions between the instances, the computing 
system nodes, and the copy of the database schema within the 
respective computing system partition ; 

a plurality of computing systems connected together via the network, 
wherein each computing system comprises one or more computing system 
partitions; 

at least one redundancy group, comprising at least one computing system 
and a plurality of computing system partitions , wherein each 
redundancy group manages the replication of the database schema 
within the computing system and computing system partitions 
within the redundancy group. 
2. The system of claim 1, wherein the redundancy group defines a first 
computing system as the computing system that replicates.., 

...10. A method for distributing computer resources in a network 
environment, comprising the steps of: 

assembling, as part of a computer network, at least one computing 
system partition , including at least one instance of an 
application, at least one computing system node, and at least one 
copy of a database schema, the copies of the database schema 
being replicated at each computing system partition within the 
computer network; 

configuring, within the computer network, a plurality of computing 
systems connected together via the computer network, wherein each 
computing system comprises... 

. . .method of claim 10, wherein the task is a database replication within 
the computer network. 

12. The method of claim 11, wherein the task of database replication 
is performed by a first computing system partition within the 
redundancy group. 

13. The method of claim 12, wherein the task of database replication 
is performed by a second computing system partition within the 
redundancy group when the first computing system partition is 
unavailable . 

14. The method of claim 10, wherein the redundancy group can be redefined 
to include a different set of computing systems, 

15. The. . . 

...method for providing database access, comprising the steps of: 

operating at least one computing system within a network, the computing 
system containing at least one computing system partition and the 

computing system being a member of a redundancy group, wherein the 
computing system partition includes at least one instance of an 
application, at least one computing system node, and at least one 
copy of a database schema, the copies of the database schema 
being replicated at each computing system partition within a 
network; and 

managing the replication of the database schema within the 

computing system and computing system partitions within the 
redundancy group. 
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systems . 
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...ABSTRACT outage on a computer network. The apparatus comprises at least 
one redundancy group comprised of one or more computing systems, 
comprised of one or more computing system partitions . The computing 
system partition includes copies of a database schema that are 
replicated at each computing system partition . The redundancy group 
monitors the status of the computing systems and the computing system 

partitions , and assigns a task to the computing systems based on the 
monitored status of the computing systems. 

...SPECIFICATION in a failure recovery system, characterized by: 

one or more computing systems connected together via a network, 
wherein each computing system comprises one or more computing system 
partitions each including at least one copy of a database schema, 
the copies . of the database schema being replicated at each 
computing system partition within a network; 

at least one redundancy group comprised of the computing systems and 
the computing system partitions, wherein each redundancy group monitors a 
status ... 

...from a computer failure, characterized by the steps of: 

operating one or more computing systems within a network, the 
computing systems comprising one or more computing system partitions 
each including at least one copy of a database schema, the copies 



...accordance with the present invention comprises at least one redundancy 
group comprised of one or more computing systems, which are comprised of 
one or more computing system partitions . The computing system 
partition includes copies of a database schema that are replicated 
at each computing system partition . The redundancy group monitors the 
status of the computing systems and the computing system partitions 
, and assigns a task to the computing systems based on the monitored 
status of the computing systems. 

The foregoing description of the preferred embodiment of... 

CLAIMS 1. A failure recovery system, characterized by: 

one or more computing systems connected together via a network, wherein 
each computing system comprises one or more computing system 
partitions each including at least one copy of a database 
schema, the copies of the database schema being replicated at 
each computing system partition within a network; 

at least one redundancy group comprised of the computing systems and the 
computing system partitions, wherein each redundancy group monitors a 
status. . . 

...from a computer failure, characterized by the steps of: 

operating one or more computing systems within a network, the computing 
systems comprising one or more computing system partitions each 
including at least one copy of a database schema, the copies of 
the database schema being replicated at each computing system 
partition within a network; 

configuring the computing systems into at least one redundancy group; 

monitoring a status of the computing systems and the computing system 
partitions within the redundancy group; and 

assigning. . . 

...computer network, characterized by the steps of 

operating one or more computing systems within the computer network, 

wherein the computing system includes at least one computing system 

partition , the computing system partition having at least one 
copy of a database schema; 
configuring the computing systems together via the computer network; 
configuring, within the computer network, at least one redundancy group, 
comprising one or more computing... 
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.SPECIFICATION support a greater load. In this case, the RDBMS access 
partition should run on each node where an RDBMS engine is located. Forte 
supports the replication of this. RDBMS access partition and provides 
a router to allow multiple RDBMS engines to service the next RDBMS 
request in the application queue. The other reason for.., 

.back-up node. For example, a node may provide a key analytical service 
or image for the user. In these cases it is possible to define a 
replicated partition as an alternate node that can be accessed in the 
event that the primary node is unavailable. Forte also provides a router 
that can access ... 
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Detailed Description 

Claims 

Fulltext Word Count: 5332 
English Abstract 

Systems and methods for automatically replicating database information. A 
subscription database (110) is queried by a database replication server 
(120) to obtain custom attributes defined in a plurality of custom 
database information subscriptions (115) . The custom attributes for each 
database information subscription (115) include: (a) the identification 
of one or more master subscription databases (130), and (b). at least one 
operation to be performed on the one or more master subscription database 
to create a custom information database. The operation (s) to be performed 
can include merging database information contained in records (135) from 
two or more master subscription databases (130), partitioning database 
information contained in one or more master subscription databases (130) . 
The operations identified by the custom attributes for each custom 
database information subscription (115) are used to automatically 
generate custom information databases (140) containing preferred database 
information from the master subscription databases (130). 

French Abstract 

L' invention concerne des systemes et des precedes permettant de repliquer 
automatiquement des informations de bases de donnees. Un serveur de 
replication (120) de base de donnees demande a une base de donnees 
d'abonnement (110) d'obtenir des attributs personnalises definis dans une 
pluralite d' abonnements d * informations (115) de base de donnees 
personnalisee . Les attributs personnalises de chaque abonnement 
d' informations (115) de base de donnees comprennent a) 1 ' identification 
d'au moins une base de donnees (130) d'abonnement maitre, et b) au moins 
une operation a executer sur la base de donnees d'abonnement maitre afin 
de creer une base de donnees d * informations personnalisee. La ou les 
operation (s) a effectuer consistent a fusionner les informations de base 
de donnees contenues dans des enregistrements (135) a partir d'au moins 
deux bases de donnees (130) d'abonnement maitres, et a partitionner les 
informations de base de donnees contenues dans la base de donnees (130) 
d'abonnement maitre. Les operations identifiees par les attributs 
personnalises pour chaque abonnement d ' informations (115) de base de 
donnees personnalisee sent automatiquement utilisees pour creer des bases 
de donnees (140) d' informations personnalisees contenant des informations 



de base de donnees preferees provenant des bases de donnees (130) 
d*abonnement maitres. 

Legal Status (Type, Date, Text) 

Publication 20020117 A2 Without international search report and to be 

republished upon receipt of that report. 

Examination 20020801 Request for preliminary examination prior to end of 

19th month from priority date 

Main International Patent Class: G06F-017/00 
Fulltext Availability: 

Detailed Description 

Claims 

Detailed Description 

Next, in Step 370, it is determined from the custom attributes whether 
the custom database information subscription requires a partition 
operation. If so, a partitioned copy of the identified Master 
Subscription Database 130 is created (Step 375) . Regardless of the need 
for partitioning , it is also determined whether the custom database 
information subscription requires a merging operation (Step 
380). If so, the identified databases are merged (Step 285); the merged 
databases can. . . 

...can also identify one or more filtering operations to be performed on a 
database (Step 0 380) . If a filtering operation is identified, the local 
copy of the identified Master Subs'cription Database 130 is filtered 
using the specified criteria (Step 395); the resulting database can be 
stored in a local database and/or in a publication database... 

Claim 

... of said 

custom database information subscriptions by performing said at least one 
operation on said database information stored in said one or more master 
subscription databases . 

10 The database replication server recited in Claim 9, wherein said 
merging, partitioning and filtering operations are definable on a 
record or record field basis 

11 The database replication server recited in Claim 9, wherein said act 
of 

generating said custom information database... 
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Claims 

Fulltext Word Count: 17030 
English Abstract 

To facilitate accurate document searching, electronically accessible 
documents are provided with abstracts written in a highly constrained 
artificial grammar. The artificial grammar is capable of expressing the 
thoughts and information ordinarily conveyed in a natural grammar, but in 
a structured format that restricts the number of possible alternative 
meanings. Accordingly, while the grammar is clear in the sense of being 
easily understood by native speakers of the vocabulary and complex in its 
ability to express sophisticated concepts, sentences are derived from an 
organized vocabulary according to fixed rules. A query, preferably 
formulated in accordance with these rules, is employed by a search engine 
in the usual fashion. Due to the highly constrained meaning of the search 
query, and the likelihood that relevant documents have similar or 
matching abstracts in their headers, key-word searches are likely to 
identify the most relevant documents. 

French Abstract 

Afin de faciliter la recherche de documents, les documents 
electroniquement accessible sont pourvus de resumes rediges dans une 
grammaire artificielle extremement comprimee. La grammaire artificielle . 
est capable d'exprimer la pensee et 1 ' information normalement exprimee 
par la grammaire naturelle, mais dans un format structure qui restreint 
le nombre de significations alternatives possibles. Par consequent, 
tandis que la grammaire est claire dans le sens qu'elle est facilement 
comprehensible par les locuteurs natifs du vocabulaire et complexe dans 
sa capacite d'exprimer des concepts sophistiques, les phrases proviennent 
d'un vocabulaire organise selon des regies fixes. Une requete, de 
preference formulee selon lesdites regies, est utiiisee par un moteur de 
recherche de la maniere habituelle. En raison du sens extremement 
comprime de la requete, et la possibilite que des documents pertinents 
aient des resumes similaires ou analogues dans leur en-tete, la recherche 
de mots-cles est susceptible d' identifier les documents les plus 
pertinents. 
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Detailed Description 

. . . and the COMMUNICATOR and NAVIGATOR products 
supplied by Netscape Communications Corp. 

To support analysis module 225 (if included), main memory 204 

may also include a partition defining a series of databases capable 

of 

storing the linguistic units of the invention; these are representatively 
de 

noted by reference numerals 2351, 2352, 23531 2354... 
, . . col 

umns-the first containing the linguistic unit, the second containing a 
definition (if the linguistic unit has more than one meaning and is 
therefore replicated in the database ) , and the third containing a . 



synonyms . 

An input buffer 24 0 receives from the user, via keyboard 210, an 
input sentence. Analysis module 225 examines the... 
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Claims 
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English Abstract 

An information storage system (100) includes a controller (116) for 
managing the resources of a common mass storage device (128) in order to 
enable multiple hosts (104, 104) connected to a common bus (106) to 
independently read and write to the mass storage device (128) in a 
relatively high speed manner on a first come, real time basis. In 
particular, a system of commands is provided which enables each host 
(104) to read and write to the mass storage device (128) on an 
independent, first come, real time basis by locking the requested address 
space irrespective of the origination. Even chough an address storage 
space may be locked, the data within such space is always readable by 
another host (104). Should a subsequent host (104) issue a command to 
write to the locked address space, the command is aborted and a flag is 
set indicating to the subsequent requesting host (104) that the area is 
locked. 

French Abstract 

Un systeme de memorisation d ' informations (100) comprend un controleur 
(116) pour gerer les ressources d'une memoire de masse commune (128) afin 
de permettre a un grand nombre d'hotes (104, 104) connectes a un bus 
commun (106) de lire et d'ecrire independamment dans la memoire de masse 
(128), avec une vitesse relativement elevee, en temps reel, et selon un 
systeme de premier arrive. En particulier, un systeme d ' instructions est 
prevu qui permet a chaque hote (104) de lire et d*ecrire dans la memoire 
de masse (128), selon un systeme en temps reel de premier arrive, en 
verrouillant I'espace d'adressage demande quelle que soit son origine. 
Meme si un espace d'adressage peut etre verrouille, les donnees contenues 
dans cet espace peuvent tou jours etre lues par un autre hote (104). 
Lorsqu'un autre hote (104) emet une instruction pour lire 1* espace 
d'adressage verrouille, 1 ' instruction est avortee et un drapeau est place 
pour indiquer a 1' autre hote (104) que la zone est verrouillee. 
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Fulltext Availability: 
Detailed Description 

Detailed Description 
... e.g., 

audit space 14 6, data space 148, keys space 150, swap space 152 and pads 
2 0 space 154). After the pointers for the partitions 142 and 144 are 
determined , addicional copies of the new database address table are 
stored in various 

protected memory storage areas, for example, one or both of the special 
purpose storage spaces 156 in step 532... 
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ABSTRACT EP 800135 Al 

There is disclosed a. method and apparatus for controlling access to and 
corruption of information in a computer system. In known "PC Virus" 
protection methods the boot partition becomes "Read Only" when the system 
is in Supervised Mode. However, Microsoft Windows, although not strictly 
self-modifying, does require that certain files located within the 
Windows directory, can be written to. Accordingly the present invention 
provides a method of controlling access to and modification of 
information stored on a storage medium forming part of a computer system 
comprising: dividing information stored on the storage medium into a 
plurality of non-overlapping partitions including a boot partition and at 
least one general partition, characterised by: designating at least one 
of said partitions a Write Many Recoverable (WMR) partition wherein, in 
use, if a write command is issued to overwrite any resident information 
stored in a/the WMR partition by updating information is written on the 
storage medium in a location other than where the resident information is 
stored and a (virtual) pointer to the updated information is set up/kept 
so that the updated information can be accessed, as required during a 
remainder of a session. 
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.CLAIMS a Sector Relocation Table (SRT) associated with it which table is 
held a Random Access Memory (RAM) of the Supervisor, each entry in a 
SRT defining the address of a range of sectors in the WMR 



partition that have been updated and an address where the updated 
information is located, this location being within the dedicated 
area . 

9. An apparatus for controlling... 
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A software partitioning tool is disclosed. Based on a visual display of 
an application that shows program objects and the connections or 
interactions between the objects, an internal representation of the 
application is defined. As the user interacts with the visual display of 
the application, creating new partitions and relocating program objects 
in the new partitions, the internal representation of the display is 
constantly updated. Once a connection between program objects crosses 
partition boundaries, it is redefined in the internal representation as a 
distributed interaction (a connection), and representative server and 
client stubs are defined. At a number of points, the user also has the 
opportunity to set middleware protocols. Once the user commits to a 
distribution design, a code generator in the tool generates the actual 
server and client stubs for all distributed connections based on the 
definitions in the internal representation of the application. 
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...CLAIMS program object parts and connections between the objects; 
defining an internal representation of the displayed application; 
in response to user action, displaying at least one partition 

boundary and defining said at least one partition boundary in 

the internal representation; 
in response to user action, relocating, on the displayed application, at 

least one program object so that its connection with other program 



objects cross said at least one partition boiindary and defining 

said connections as distributed connections in the internal 

representations- 
determining from said distributed connections server objects and client 

objects; and 
in response to a user... 
...comprising the computer-implemented steps of: 

initially defining an internal representation of the objects and 

connections of the displayed application design; 
in response to user definition of at least one partition boundary 

in the displayed application design, defining corresponding empty 

partition containers in the internal representation; 
in response to user relocation of at least one program object across 

said at least one partition... 

...visual representation of connections between the objects, the tool 
comprising : 

a metadata generator for defining a current internal representation of 
the displayed application and for defining any connections crossing 
partition boundaries in said displayed application as distributed 
connections in the current internal representation; and 

a code generator for generating distributed interfaces for all 
distributed connections defined... 

...internal representation of the objects and connections of the displayed 
application design; 

computer readable program code means for causing the computer, in 

response to user definition of at least one partition boundary 
in the displayed application design, to define corresponding empty 
partition containers in the internal representation; 

computer readable program code means for causing the computer, in... 



27/5, K/11 (Item 11 from file: 348) 

DIALOG (R) File 348: EUROPEAN PATENTS 
(c) 2003 European Patent Office. All rts. reserv. 

00378012 

Partitioning of sorted lists for multiprocessor sort and merge. 
Verteilung geordneter Listen zur Mehrprozessor-Sortierung und zum -Mischen. 
Decoupage de listes triees pour le tri et la fusion par plusieurs 
processeurs . 

PATENT ASSIGNEE: 

International Business Machines Corporation, (200120), Old Orchard Road, 
Armonk, N.Y. 10504, (US), (applicant designated states: DE;FR;GB) 
INVENTOR: 

Iyer, Balakrishna Raghavendra, 39503 Ross Common, Fremont California 
94538, (US) 

Ricard, Gary Ross, 607 Tenth Street S.E., Rochester Minnesota 55904, (US) 
Varman, Peter Joseph, 7530 Bromton, 823, Houston Texas 77025, (US) 

LEGAL REPRESENTATIVE: 

de Pena, Alain et al (15151), Compagnie IBM France Departement de 
Propriete Intellectuelle, F-06610 La Gaude, (FR) 

PATENT (CC, No, Kind, Date) : EP 378038 A2 900718 (Basic) 

EP 378038 A3 910502 

APPLICATION (CC, No, Date): EP 89480181 891206; 

PRIORITY (CC, No, Date): US 297634 890113 

DESIGNATED STATES: DE; FR; GB 

INTERNATIONAL PATENT CLASS: GO6F-007/36; G06F-009/46; G06F-015/16; 
CITED REFERENCES (EP A): 

I.E.E.E. TRANSACTIONS ON COMPUTERS vol. 37, no. 12, 

December 1988, pages 1619-1626, New York, US; R.S. FRANCIS et al.: "A 
Benchmark Parallel Sort for Shared Memory Multiprocessors" 

THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS AND COMMUNICATION 

ENGINEERS OF JAPAN vol. E69, no. 9, September 1986, pages 

996-1001, Tokyo, JP; S. HORIGUCHI et al.: "A Parallel Sorting Algorithm 
for a Linearly Connected Multiprocessor" 

IBM TECHNICAL DISCLOSURE BULLETIN vol. 31, no. 1, June 



1988, pages 383-388, Armonk, New York, US; "Efficient Parallel 
Quicksort Using Fetch-and-Add in Multi-Processor Computing Systems"; 

ABSTRACT EP 378038 A2 

Any number of sorted lists are efficiently partitioned into P lists, 
where P represents the number of processors available to sort the 
resulting lists. When given a large list to sort, the list is initially 
divided into P lists, and each processor sorts one of these lists. The 
lists are then exactly partitioned so that each of the elements in the 
new consecutive partitioned lists have values no smaller than any of the 
elements in the lists before it, nor larger than any of the elements in 
the list following it. Partitioning is done by P-1 processors. Each of 
the processors successively considers selected rows of elements from the 
sorted lists, and moves a partition boundary based on an element 
magnitude requirement and a partition size requirement. The new 
partitioned lists are then merged by the P processors, and simply strung 
together to provide a sorted list of all the elements. 
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...CLAIMS consideration; 

b) fixing a partition boundary near the middle row of elements; 

c) determining the maximum value of all the elements under 
consideration above the partition boundary ; 

d) determining the elements under consideration below the 
partition boundary that are less than the maximum value; 

e) moving elements about the boundary based on the size... 

...of elements which should be moved to make the partitions the correct 
size; and 

moving the number of elements to be moved to make the partitions 
the correct size minus the number of elements determined in step 
d from below the boundary to above the boundary. 
9. A method of sorting a list of N elements using P processors, where 

...comprising the steps of: 

a) dividing the list into P sublists of approximately N/P elements; 

b) each processor sorting one of the sublists; 

c) defining P-1 partition boundaries , each boundary being 
defined by one of P-1 of the processors, said boundaries dividing 
the lists into nearly equal partitions of elements having values less 
than all of . . . 

...the presorted lists, each processor comprising: 

means for selectively and iteratively adding elements from the 
lists to a partitioning list; 

means for selecting an initial partition boundary for the 



partitioning list; 

means for determining a size modifier based on the desired 
number of elements above the partition boundary versus the actual 
number of elements above the partition boundary ; 

means for determining a magnitude modifier based on the number 
of elements below the partition boundary which are less than the 
magnitude of the largest element above the partition boundary... 

...the presorted lists, each processor comprising: 

means for selectively and iteratively adding elements from the 
lists to a partitioning list; 

means for selecting an initial partition boundary for the 
partitioning list; 

means for determining a size modifier; 

means for determining a magnitude modifier; and 

means for modifying the partition boundary as a function of the 
size modifier and magnitude modifier following each iterative 
addition of . . . 
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Detailed Description 

Claims 

Fulltext Word Count: 11884 
English Abstract 

A network-based storage system comprises one or more block-level storage 



servers (104) that connect to, and provide disk storage for, one or more 
host computers (102) over logical network connections (preferably TCP/IP 
sockets) 400. In one embodiment, each host (102) can maintain one or more 
socket connections (400) to each storage server (104), over which 
multiple I/O operations may be performed concurrently in a non-blocking 
manner- The physical storage of a storage server (104) may optionally be 
divided into multiple partitions, each of which may be independently 
assigned to a particular host (102) or to a group of hosts. Host driver 
software (204) presents these partitions to user-level processes as one 
or more local disk drives. When a host (102) initially connects to a 
storage server (104) in one embodiment, the storage server (104) 
initially authenticates. the host, and then notifies the host (102) of the 
ports that may be used to establish data connections (400) and of the 
partitions assigned to that host (102). 

French Abstract 

L* invention concerne un systeme de stockage base reseau, comprenant au 
moins un serveur de stockage de niveau bloc (104), lequel se connecte a 
un ou plusieurs ordinateurs botes (102), sur des connexions reseau 
logiques (400) (de preference des prises TCP/IP), et permet le stockage 
disque pour ce ou ces ordinateurs. Dans un mode de realisation, chaque 
hote (102) peut conserver une ou plusieurs connexions de prise (400) avec 
chaque serveur de stockage (104), connexions par 1 ' intermediaire 
desquelles il est possible d'executer plusieurs operations d*E/S, de 
maniere concurrente, sans blocage. Le stockage physique d*un serveur de 
stockage (104) peut se decouper eventuellement en plusieurs partitions, 
chacune pouvant etre assignee de maniere independante a un hote en 
particulier (102) ou a un groupe d' botes. Le logiciel de pilotage hote 
(204) presente ces partitions aux procedes niveau utilisateur, sous forme 
d*une ou plusieurs unites de disques locaux. Dans un mode de realisation, 
lorsqu'un hote (102) se connecte d'abord a un serveur de stockage (104), 
le serveur de stockage (104) authentifie d'abord I'hote (102), puis 
indique a ce dernier les ports qui peuvent etre utilises aux fins 
d'etablissement de connexions de donnees (400), ainsi que les partitions 
assignees a cet hote (102) . 
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Fulltext Availability: 
Claims 

Claim 

provides functionality for allocating a partition to multiple host 
computers to permit sharing of partitions. 

42 The storage server system of Claim 38, wherein the partitions have a 
user- definable size . 



43 The storage server system of Claim 38, wherein the software system 
supports the ability for a host computer to concurrently perform multiple 
inputioperations over. . . 
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Claims 

Fulltext Word Count: 19072 
English Abstract 

Methods and systems (1000) are provided for merging computer disk 
partitions to reduce the number of partitions (1010) . Unlike conventional 
approaches that rely on FDISK, the invention does not destroy user data 
on the disk (1008) during or after the two or more partitions are merged. 
Two or more adjoining partitions may be combined. During a merging 
operation, partitions may have their clusters aligned (612) or resized 
(614) . The merging partitions may also have their partition type changed 
(606) . During the merge at least one copy of all system and user data of 
all partitions is kept on a disk at ail times, reducing the risk of data 
loss . 

French Abstract 

L' invention concerne des procedes et des systemes (1000) destines a 
fusionner des partitions de disque afin de reduire le nombre de 
partitions (1010). Contrairement aux approches traditionnelles qui 
reposent sur 1 * utilisation de FDISK, la presente invention ne detruit pas 
les donnees utilisateurs du disque (1008) durant ou apres la fusion des 
deux partitions ou plus. Deux partitions adjacentes ou plus peuvent etre 
associees. Au cours de 1* operation de fusion, les blocs des partitions 
peuvent etre alignes (612) ou redimensionnes (614). Le type de partition 
des partitions qui fusionnent est egalement modifie (606). Au cours de la 
fusion, au moins une copie de toutes les donnees du systeme et des 
donnees utilisateur de toutes les partitions sont conservees en 
permanence sur un disque, ce qui reduit ainsi le risque de perte de 
donnees . 

Legal Status (Type, Date, Text) 
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Fulltext Availability: 
Claims 

Claim 

... The method of claim 1, further comprising completing the-merger of 
each secondary partition into the target partition to produce the merged 
partition, the merged partition having the determined cluster size 
and the determined partition type, the method being performed without 
destroying user data of the target partition except at user request and 
without destroying user data of any secondary ... progress markers 
corresponding- to 

incrementally increasing portions of the merged partition production. 

32 The system of claim 27, wherein the system further comprises a cluster 



size checker which determines whether a partition needs to have its 
clusters resized, and the system further comprises a cluster resizer 
which resizes those clusters of the partition which the cluster size 
determiner has determined need to be resized. 

33 The system of claim 27, wherein the system preserves at least one copy 
of all system data of all merging... 
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Claims 

Fulltext Word Count: 21408 
English Abstract 

A method is described for allowing a user to select one of a plurality of 
items. The user employs a device having a display area, and a joystick or 
a contact sensitive area. The device displays a number of regions equal 
to the number of items, and defines a number of sections in the angular 
range of the joystick, or sections within the contact sensitive area, 
equal to the number of items, and arranged corresponding to the 
arrangement of the regions of the display area. The user selects one of 
said items by selecting the corresponding section. 

French Abstract 

L* invention a trait a un procede permettant a un utilisateur de 
select ionner un article parmi plusieurs. L * utilisateur utilise un 
dispositif dote d'une zone d'af.fichage et un module d ' instruction ou une 
zone sensible de contact. Le dispositif affiche un certain nombre de 
regions egales au nombre d' articles, et definit un certain nombre de 
sections dans la portee angulaire dudit module, ou de sections a 
I'interieur de la zone sensible de contact egales au nombre d' articles et 
disposees de maniere a correspondre a la disposition des regions de la 
zone d'affichage. L' utilisateur selectionne un desdits articles en 
choisissant la section correspondante . 

Legal Status (Type, Date, Text) 

Publication 20010301 A2 Without international search report and to be 

republished upon receipt of that report. 
Search Rpt 20010621 Late publication of international search report 
Republication 20010621 A3 With international search report. 
Search Rpt 20010621 Late publication of international search report 



Examination 20010726 Request for preliminary examination prior to end of 

19th month from priority date 
Claim Mod 20011115 Later publication of amended claims under Article 19 

received: 20010521 
Republication 20011115 A3 With international search report. 
Republication 20011115 A3 With amended claims. 

Fulltext Availability: 

Claims 
Claim 

one of said 
sections . 

4 A method according to claim 3 in which the sections 
collectively cover the whole of the angular range, so 

that defining the sections is equivalent to partitioning 
the angular range . 

5 A method according to any preceding claim in which 
the user can (i) vary the selection of the item, 
information being displayed in relation. . . 
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Claims 

Fulltext Word Count: 45843 
English Abstract 

A computer system comprises a plurality of processing modules that can 
be configured into different partitions within the computer system, and a 
main memory. Each partition operates under the control of a separate 
operating system. At least one shared memory window is defined within the 
main memory to which multiple partitions have shared access, and each 



partition may also be assigned and exclusive memory window. Program code 
executing on different partitions enables those partitions to communicate 
with each other through the shared memory window. Means are also provided 
for mapping the physical address space of the processors in each 
partition to the respective exclusive memory windows assigned to each 
partition, so that the exclusive memory windows assigned to each 
partition appear to the respective operating systems executing on those 
partitions as if they all start at the same base address. 

French Abstract 

L' invention concerne un systeme informatique qui comprend une pluralite 
de modules de traitement que I'on peut configurer en differentes 
partitions dans le systeme informatique, et une memoire principale. 
Chaque partition fonctionne sous la commande d*un systeme d' exploitation 
separe. Au moins une fenetre de memoire partagee est definie dans la 
memoire principale a laquelle plusieurs partitions ont un acces partage, 
et chaque partition peut aussi se faire attribuer une fenetre de memoire 
exclusive. L* execution d'un code programme dans differentes partitions 
permet a ces partitions de communiquer entre elles par la fenetre de 
memoire partagee. Cette invention concerne aussi des moyens permettant de 
projeter I'espace d'adresses physiques des processeurs dans chaque 
partition dans les fenetres de memoire exclusives respectives attribuees 
a chaque partition, de facon que les fenetres de memoire exclusives 
attribuees a chaque partition semblent toutes partir de la meme adresse 
de base pour les systemes d ' exploitation respectifs qui s'executent dans 
ces partitions. 

Fulltext Availability: 
Claims 

Claim 
. . . DATA 
i 1314 

RELEASE SUB-PODs FROM RESET 
(IDENTIFY BIOS SUB-PODS (BSPs)) 
1316 

INITIALIZE PCI 

BUSSES 

i 1318 

READ CONFIGURATION DATA OPTIONAL 

TO IDENTIFY PARTITIONS 

1320 

CAIXrUIATE SIZE OF 
HIGH AND LOW 
MEMORY HOLES 
1322 

INFORM MANAGEMENT INTERFACE 
PROCESSOR (MIP) OF THE AMOUNT OF 
MEMORY-MAPPED 1/0 SPACE REQUIRED 
BY PCI CARDS. . . 
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English Abstract 

The invention is a storage device {1) for a host computer system. The 
device (1) incorporates a Supervisor function for controlling access to 
information stored in a storage medium (2) of the device. The main 
embodiment described is a hard disk drive (1) comprising: one or more 
disk platters (2) for storing information; a ROM (4) for storing firmware 
for controlling operation of the drive; a volatile RAM (5); a 
micro-controller (7) for controlling the transfer of information to and 
from the disk platter (s) (2); and an interface (6) for interfacing the 
drive (1) with the host computer system and via which information is 
transferred to and from the disk platter (s) (2) under the control of the 
micro-controller (7) . A Supervisor is provided in the form of firmware 
which is preferably stored in the ROM (4), the Supervisor operating the 
micro-controller (7) so as to protect information stored on the disk 
platter(s) . 

French Abstract 

La presente invention concerne un dispositif de stockage (1) destine a 
un systeme informatique hote. Le dispositif de 1' invention (1) comprend 
une fonction de superviseur qui commande I'acces aux informations 
stockees sur un support de stockage (2) du dispositif. Dans le mode de 
realisation principal, le dispositif est compose d'une unite de disque 
dur (1) comprenant: un ou plusieurs supports disques (2) destines a 
stocker les informations; une ROM (4) destinee a stocker les logiciels 
microprogrammes qui commandent le f onctionnement de 1 'unite; une RAM 
volatile (5); un microcontroleur (7) qui commande le transfert des 
informations depuis et vers le(s) support (s) disque (s) (2); et une 
interface (6) qui assure 1 ' inter facage entre I'unite de disque (1) et le 
systeme informatique hote et via laquelle les informations sont 
transferees vers et depuis le(s) support (s) disque (s) (2) sous le 
controle du microcontroleur (7). Un programme superviseur, se presentant 
sous la forme d'un logiciel microprogramme stocke dans la ROM (4), assure 
le fonctionnement du microcontroleur (7) de facon a proteger les 
informations stockees sur le(s) support(s) disque(s) (2). 

Fulltext Availability: • 
Claims 

Claim 

... is held in said volatile EVVM means 

35 (5) of the storage device (1), and each entry in a said SRT is 
a pointer which defines the address of a range of sectors in 
the WMR partition that have been updated and an address where 
the updated information is located, this location being within . 
a dedicated area on the storage medium (2... 
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Fulltext Word Count: 4903 
English Abstract 

A system and method for dynamically resizing queues (1023) used in a 
network switch (210) to accommodate potential congestion situations 
without experiencing data loss. In one embodiment, partiton pointer 
registers (410) are used to indicate when resizing is desirable. The 
control logic (405) then determines when it is safe to update the size of 
the queue such that no data loss occurs and timely updates the queue 
size. 

French Abstract 

L' invention concerne un systeme et un procede servant a redimensionner 
de facon dynamique des files d'attente (1023) dans un centre de 
commutation (210) de reseau, afin de faire face a des situations 
potentielles de congestion sans pertes de donnees . Dans un mode de 
realisation, des registres (410) de pointeurs de partition sent utilises 
pour indiquer quand un redimensionnement est souhaitable. La logique 
(405) de commande determine ensuite le moment le plus sur pour mettre a 
jour la dimension de la file d*attente afin qu'il n*y ait aucune perte de 
donnees, et effectue la mise a jour de la file d'attente au bon moment. 

Fulltext Availability: 

Claims 
Claim 

one queue; 

for each partition boundary to be updated; 

checlcing states of the at least one queue affected by movement 
of the location of the partition boundary ; 

determining when it is safe to move each location of the 

partition boundary ; and 
when it is determined that it is safe to move the location of the 

1 I partition boundary, updating the partition boundary to the updated 
location. 

2 The method. . . 

...not located in the area affected by movement of the partition boundary. 
5 An apparatus comprising: 

a memory comprising at least one queue, each queue defined by 

partition boundaries ; 
at least one partition pointer, each partition pointer identifying a 
location of a partition boundary; and 

control logic coupled to the memory and the at least one partition 
pointer... to indicate an updated location of the partition 
boundary; 

checking states of at least one queue affected by movement of the 
10 location of the partition boundary ; 

I I determining when it is safe to move the location of the partition 

12 boundary ; and 

19 

when it is determined that it is safe to move the location of the 
partition boundary, updating the partition boundary to the updated 
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Statistical information collection apparatus e.g. for population, for 
business applications, determines proposed site for shop erection based 
on statistical data corresponding to each partition on area map 
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Abstract (Basic): JP 2003167881 A 

NOVELTY - A display unit displays an electronic map of a designated 
area, which is divided into several partitions. The databases (2-6) 
store statistical data including population, number of public 
institutions, etc., corresponding to each partitioned area of the 
map. A determination unit determines a proposed site for erecting a 
shop based on the statistical data corresponding to each partitioned 
area on the map. 

USE - Used in business applications for collecting statistical 
information such as number of public institutions such as schools, 
number of residents and population composition in designated area, for 
determining proposed site for erecting shops. 

ADVANTAGE - Automatically performs area analysis to determine a 
suitable erection site efficiently. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the statistical data collection system. (Drawing includes non-English 
language text) . 

information process (1) 
statistical information databases (2-6) 
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Database administration and replication method involves storing 
statistics for each of database sampled records to perform 
extrapolated replication partition analysis operation on database 
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Abstract (Basic): US 20030004973 Al 

NOVELTY - The database records are randomly sampled using a 
random sampling facility (26) which is integrated within a database 
management system (14) . The statistics for each of the sampled 
records are stored, based on which an extrapolated replication 
partition analysis operation on the database , is performed. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
database management system. 

USE - For administration and replication of database storing 
business setting information, individual and corporate accounts,' etc. 

ADVANTAGE - Approximation partition analysis is performed 
without straining or otherwise compromising computer system resources. 
The integrated sampling facility reduces number of system calls 
required for performing the analysis and also enables rapid access to 
records being retrieved. 

DESCRIPTION OF DRAWING (S). - The figure shows a block diagram of 
computer system. 

database management system (14) 

random ssimpling facility (26) 
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Database partition boundary determination method in information 
system, involves sampling records of database using random number 
algorithms, which are added or deleted from database 
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Abstract (Basic): US 20030004944 Al 

NOVELTY - A particular number defining a desired sample size is 
selectively received to provide a seed value for initializing a random 
number algorithm. The records of a database (10) which are randomly 
sampled using the algorithm, are added or deleted from the database . 

Statistics for each record including a record key is stored to 
produce an approximation partition analysis . 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for 
database partition boundary determination system. 

USE - For databases in information system for business 
application. 

ADVANTAGE - Enables obtaining accurate analysis for dynamically 
changing databases even though approximation partition analysis 



is not mathematically exact. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the computer system. 
datal>ase (10) 
pp; 10 DwgNo 1/3 
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Characterizing S by the identification of textual and physical structured 
query fragments, useful for the analysis of textual and biopolymer 
information 
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Abstract (Basic) : US 20020177138 Al 

NOVELTY - Characterizing (Ml) a set of strings (S) comprising 
receiving S with process-pattern containing substrings, defining series 
of search target S patterns effective for searching S and processing 
through an ordered series of search steps, each step being specific for 
one search class and an attempted discovery of an appropriate search 
target site to define a delimited search region for the next step, 
thereby characterizing S, is new. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for: 

(1) analyzing (M2) a set of polynucleotides (PN*s), comprising 
identifying electronic structured query fragments using M and isolating 
physical structured query fragments, where the isolating comprises 
providing the set of PN's and isolating physical structured query 
fragments within the set of PN's by isolating fragments that remain 
after processing the set of PN's through a series of step-wise 
delimitation processes comprising cleaving the set of PN's with a 
cleavage effector to form a set of PN fragments including target PN 
fragments, and retaining only the target PN fragments for a next 
preemptive cleavage according to each recognition site pattern of the 
series of recognition site patterns, and comparing the electronic 
structured query fragments to the physical structured query fragments, 
thereby analyzing the set of PN's; 

(2) isolating and characterizing (M3) a set of PN's using M2; 

(3) characterizing (M4) using Ml and defining a process for 
identifying the process-pattern containing substrings based on a 
selected arrangement of search targets within a search target string 
pattern, and performing the process to identify the process-pattern 
containing substrings within the S for each search target pattern in 
the series of search target patterns, thereby characterizing the S; 

(4) characterizing (M5) sets of strings, comprising receiving one 
or more sets of strings of any length, where may be found occurrences 



of relatively short search-target-strings of interest, and where one or 
more of the short search-target-strings are used to define a distinct 
search target, and where several distinct search targets or targets are 
assembled into structured entities known as search target groups, where 
a search target group is comprised of a partition search target that is 
used to partition the sets of strings under study into substrings or 
partition fragments bounded by consecutive occurrences of the partition 
search target, and a small array of a limited number M of major classes 
or ordered sets of search targets, where each major class is comprised 
of a limited number of ranked member search targets, and where a search 
target group of target group, of two or more search target groups or 
target groups of distinct composition of structure, may be used to 
characterize search target group-defined substrings found within the 
sets of strings under study, using the structure and composition of a 
search target group with M major classes to define a search process 
comprised of a series of M search steps that are to be effected within 
each of the partition fragments obtained, from the sets of strings 
under study, using the partition search target of the target group, and 
where the search process defines patterns, of occurrence within the 
partition fragments of search targets that are members of the target 
group, and where partition fragments or regions therein may be 
characterized by the occurrence of instances, of the process patterns 
that may be defined by the structure and composition of the target 
group, and using the structure and composition of a search target group 
with M major classes to effect a search process comprised of a series 
of M search steps within each of the partition fragments obtained, from 
the sets of strings under study, using the partition search target of 
the target group, and where the search process results in the detection 
of process-pattern entities, where each process-pattern entity is 
comprised of a pattern of M search target sites, which together include 
a search target site representing one member of each of the M major 
classes in the target group, and where each of the sites must be 
present and where sites representing higher-ranked members of the same 
major class must be absent within the relevant search area for the 
major class in the partition fragment, and where the process pattern 
entities are obtained as a result of a stepwise search and delimitation 
process after each site is found that restricts the region of the 
partition fragment where the next class-specific target-search occurs, 
and where partition fragments or regions therein may be characterized 
by the occurrence therein of process-pattern entities, where the 
process-pattern entities represent instances of the process-patterns 
that may be defined by the structure and composition of the target 
group, and where partition fragments or regions therein may be 
characterized by the occurrence therein of structured query fragments 
(SQFS) that are fragments bounded any two search target sites in a 
process- pattern entity, and whose lengths can be calculated by the 
positions of the constituent sites that comprise the process-pattern 
entity wherein the SQFs are found, and where the SQFs of particular 
interest are typically the SQFs bounded by the last two search target 
sites detected in the identification of a process-pattern entity; and 

(5) physical characterization (M6) of a Scunpl© of PN*s of the 
same general type. 

USE - Ml to M6 are useful for identifying, classifying, comparing, 
generating and/or separating fragments derived from one or more 
physical samples of PN*s. They can also be used in computational and 
laboratory methods and databases for analyzing textual and biological 
sequence information. 
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Management device for partition table of database has addition section 
which replaces existing partition of partition table of database based 
on extracted partition definition information 
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Abstract (Basic): JP 2002041333 A 

NOVELTY - A definition information extraction section obtains the 
partition definition information used in a new partition to be 
replaced and added to the existing partition of the partition table of 
a database . An addition section replaces the existing partition of 
the partition table of the database with the new partition based on 
the extracted partition definition information. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for a 
database partition table management method. 

USE - For partition table of database 

ADVANTAGE - Enables automatic replacement and addition of new 
partition to existing partition of partition table of database . 
Maintains improvement of a disc in input-output performance by 
maintaining distribution condition of partition, thereby preventing 
data overflow of a predetermined area. 

DESCRIPTION OF DRAWING (S) - The figure shows the sample of script 
production of a partition addition. (Drawing includes non-English 
language text) . 
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Restoration of database in a computer, involves applying modifications in 
log file to copied objects, including table index and partition index, 
during one pass through log file 
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Abstract (Basic) : US 6119128 A 

NOVELTY - The method involves copying objects, including the table 
index and partitioning index, from the secondary data storage device to 
the database on the primary data storage device after receiving a 
recovery indicator. Modifications in the log file are applied to the 
copied objects, including the table index and partitioning index, 
during one pass through the log file. 

DETAILED DESCRIPTION - The method begins by copying different 
objects in the database from the primary data storage device to the 
secondary storage device, in which the table index is used to locate 
data in a table while the partitioning index defines the scope of 
each partition and assigns a row of the table to respective 
partition. Modifications to the objects are logged in the log file. The 
recovery indicator shows the required recovery of objects in the 
database. INDEPENDENT CLAIMS are also included for the following: 

(a) the restoration apparatus used on the database of a computer; 

(b) and the manufacture of the computer program carrier used in 
database restoration. 

USE - Used in computer-implemented database systems and in 
recovering different types of objects with one pass of the log. 

ADVANTAGE - Provides recovery for partitions, partitioning indexes 
and table indexes simultaneously. Requires only one pass of log file to 
apply .modifications to database. 

DESCRIPTION OF DRAWING(S) - The figure shows the recovery system 
for database in computer. ^ 
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Provision method for continuous database service and scalable query 
performance uses active redundant copies , redundancy group and several 
computing system partitions , each group manages database schema 
replication in partitions 
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Abstract (Basic) : EP 992909 A2 

NOVELTY - The method has several computing systems connected 



together via a network each comprising one or more computing system 
partitions . A redundancy group has a computing system and several 
computing system partitions , with each redundancy group managing the 

replication of the database schema within the computing system 
and computing system partitions within the redundancy group. 

USE - For the provision of continuous database service and scalable 
query performance using active redundant copies. 

ADVANTAGE - Provides a system with reasonable development costs and 
implementation schedules that does not sacrifice the benefits of open 
systems . 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram of 
the hardware environment that could be used, 
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Replicated object management method for hierarchical network database - 
involves determining object IDs of target and its parent objects, and 
combining object IDs to form database-wide object ID 

Patent Assignee: NOVELL INC (NOVE-N) 

Inventor: IZATT L; OLDS D R; PRASAD R 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 
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US 958671 P 19951215 
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Patent Details: 
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US 5832487 A 15 G06F-017/30 CIP of application US 94357466. 

Provisional application US 958671 
CIP of patent US 5608903 

Abstract (Basic) : US 5832487 A 

The method involves obtaining a replica ID (102) which identifies 
the replica relative to other replica in database . The replica 
ID and an integer value (104) are used to form a partition-wide object 
ID for target object. 

The integer value is calculated by event counter value (108), 
pseudo-random value (110), time stamp value (112), GUID value (114). 
The partition -wide object ID is determined for each parent object 
of target object. The parent and child object IDs are combined to form 
database-wide object ID. 

USE - In distributed digital networlc. 

ADVANTAGE - Unique identifier of database is not updated during 
updatipn of object name. Does not allow distinct object to have same 
IDs. 
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Abstract (Basic): US 6341289 Bl 

NOVELTY - The computer system (100) has a computer program stored 
in a memory (120) and executed by a processor (110). The computer 
program includes a schema mapper for mapping between object attributes 
and fields in a database table. The schema mapper defines the source 
of a partitioning key value and the partitioning key field in the 
database table for storing the partitioning key value. 

DETAILED DESCRIPTION - The partitioning key value identifies the 
partition containing the object within a class of objects. The 
partition also defines the subclass of objects with the class. 
INDEPENDENT CLAIMS are also included for the following: 

(a) the computer program; 

(b) and the mapping method between objects and database table 
used to persistently store objects. 

USE - For object identity and partitioning for user defined 
extents . 

ADVANTAGE - Allows transparent and flexible partitioning of created 
objects. Allows queries to be performed against partition without 
requiring user to have any specific knowledge of the partitioning 
structure. Provides customization and extension quality of framework 
mechanisms that are valuable to framework consumers because the cost of 
customizing or extending a framework is much less than the cost of 
replacing or reworking an existing solution. Allows maximum flexibility 
in application development and deployment. 

DESCRIPTION OF DRAWING (S) - The figure shows the schematic view of 
the computer system. 

Computer system (100) 

Processor (110) 

Memory (120) 
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Repartitioning of data stored in direct access storage device connected 
to computer, involves reorganizing identified partitions based on altered 
partitioning scheme by moving data between identified partitions 
Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 

Inventor: COURTER D K; DESAI P S; FRISKE C A; GARTH J M; GOUGHERTY K D; 

KUNIOKA-WEIS L M; RAIMAN D E; RUDDY J A; WATTS J A; ZEIDENSTEIN K R 
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Abstract (Basic): US 6125370 A 

NOVELTY - The adjacent partitions of data that would be affected by 
the altered partitioning scheme, are identified. Access to each 
identified partition is restricted, without restricting access to other 
partitions. The identified partitions are reorganized based on the 
altered partitioning scheme, by moving data between the identified 
partitions, while allowing access to other partition. 

DETAILED DESCRIPTION - The change in partitioning scheme for data 
is detected during alteration of partitioning index specifying one or 
more partitions and a limit key for each partition . The limit key 
defines a range of values for the partition . The adjacent 
partitions affected by the altered partitioning scheme are identified 
and reorganized. An INDEPENDENT CLAIM is also included for data 
repartitioning apparatus. 

USE - For data repartitioning in relational databases stored in 
direct access storage devices such as hard disk drive, tape drive, 
floppy disk drive connected to computer. 

ADVANTAGE - Because the rebalancing of data is limited to the 
affected partition, the repartitioning system provides a technique for 
rebalancing a subset of partitions without restricting access to 
unaffected partitions. Enables shifting of data among partitions based 
on the changed partitioning scheme, reliably. 

DESCRIPTION OF DRAWING(S) - The figure shows the flow diagram 
illustrating the process sequence involved in data repartitioning 
method. 
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Information handling system for multiprocessor database - has workfile 
disks which are logically partitioned into multiple groups and shared by 
logical processors which separately execute mergesort operation 

Patent Assignee: INT BUSINESS MACHINES CORP (IBMC ) 

Inventor: WU K; YU P S 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 
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Patent Details: 
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US 5671405 A 7 G06F-007/06 

Abstract (Basic): US 5671405 A 

The system includes several logical processors each of which 
operates under control of an image of an operating system program. A 
database storage system stores data in a predetermined data structure 
and one or more workfile storage disk store workfiles during mergesort 
operations. One or more workfile storage disks are shared by one or 
more logical processors. Mergesort operations are executed on separate 
logical processors. 

The execution involves sorting the data structure into one or more 
ordered runs and determining a logical partition size of the 
workfile storage disks. A least loaded partition is selected and one or 
more ordered runs are written into the workfile storage disks in the 
selected partition. The ordered runs are merged into- a single sorted 
run . 

ADVANTAGE - Processes efficient concurrent mergesorts. Allows 
dynamically choose less loaded partition to achieve benefits of load 
balancing . 

Dwg. 3/3 

Title Terms: INFORMATION; HANDLE; SYSTEM; MULTIPROCESSOR; DATABASE ; DISC; 

LOGIC; PARTITION; MULTIPLE; GROUP; SHARE; LOGIC; PROCESSOR; SEPARATE; 

EXECUTE; OPERATE 
Derwent Class: TOl 

International Patent Class (Main): G06F-007/06 
File Segment : EPI 



15/5/7 (Item 7 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 
(c) 2003 Thomson Derwent. All rts. reserv. 

011159315 *^Image available** 

WPI Acc No: 1997-137240/199713 

Still-picture registration processor for still-picture database 
applications - in which image registering part registers information on 
intersection of each block based on representation colour 
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Abstract (Basic) : JP 9016614 A 

The still-picture registration processor divides the still- picture 
data (2) into several blocks separated by boxindary lines, using a 
block partitioning part (3). A border-line calculating part (4) 
computes a outline-representing point, which is obtained as the 
intersection of the boundary line and border line of an image, for 
every boundary-line of each block. When the outline-representing point 
is obtained at two or more places, a border line simplifying part (5) 
selects the representation indication of two points from the 
outline-representing part. 

A colour calculating part (6) divides each block into two areas 
using linear approximation which connects the intersections and 
computes the allowed colour specification for each area. An image 
registering part (7) registers the information (8) on the intersection 
of each block and on each representation colour. 

ADVANTAGE - Simplifies display of still-picture data. Enables easy 
distinction of data. Improves image characteristics. Enables easy 
digital signal processing and provision of hardware. Improves 



operativity. Enables high speed processing. 
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Buffering packets in digital communications system in order to fairly 
distribute unused buffer space between connections and traffic flow 
groups 
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Abstract (Basic) : CA 2273291 Al 

NOVELTY - A hierarchy of memory partitions is defined , where each 
partitions consists of child partitions . The size of the top 
level partitions is set, whilst the nominal partition size for 
the child partitions is dynamically computed based on the 
congestion of each given child memory partition. The final step is 
iterated until all the partition sizes have been set. 

USE - For digital communications system e.g. ATM network. 

ADVANTAGE - Fairly distributes unused buffer space between 
connections and traffic flow groups. 

DESCRIPTION OF DRAWING (S) - The drawing shows a schematic diagram 
of the memory hierarchy in the buffer. 
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Optimized runtime communication processing in co-/multisimulation 
environment, involves limiting synchronizations between solvers to 
situation in which simulation is performed based on provided event 
information 
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Abstract (Basic): US 6108494 A 



NOVELTY - The optimized direction information of boundary nets 
(425) that satisfy partitioning rules, is determined . The solvers 
are synchronized based on determined optimized direction information, 
in response to event information provided by first solver. The 
synchronizations between first and other solvers are limited to 
situations in which simulation is' performed by each solvers depending 
on event information. 

DETAILED DESCRIPTION - The design source (415) is read upon which 
first simulator operates. The desired source defines several cells 
representing a design of the system or a portion of it. The two or more 
instances of a cell is identified, where respective subset of instances 
containing one instance but not all of the cell's instances is assigned 
to a predetermined solvers based upon the set of partitioning rules. 
The edited design source corresponding to a partition to be stimulated 
by the first solver and netlist information in format understandable by 
second solver are generated. The edited design source includes modified 
cell description of a parent cell with which the cell is associated. 
The directions associated with boundary nets are accumulated based on 
corresponding netlist information. Each boundary net has a direction 
associated with each design partition it connects. An INDEPENDENT CLAIM 
is also included for optimized runtime communication processing 
program. 

USE - For use in co-/multisimulation environment and electronic 
design automation. 

ADVANTAGE - Since the synchronizations between solvers are limited 
to situations in which simulation is performed depending on event 
information from another solver, the runtime is optimized by avoiding 
unnecessary synchronizations during the simulation session. Increases 
runtime performance of co-/multisimulation environment by reducing 
number of connections and traffic between simulators. 

DESCRIPTION OF DRAWING (S) - The figure shows the overview of N-way 
co-/simulation process. 

Design source (415) 

Boundary net (425) 
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Docauaent partitioning fractionation method in digital computer for 
nonhierarchical , linear-time partitioning of corpus of documents by 
determining partitioning of desired size from ordering 
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Abstract (Basic): EP 980043 A2 

NOVELTY - The method involves preparing an ordering of a corpus by 
determining a partitioning of a desired size from the ordering, 
and the partitioning is further refined. 

DETAILED DESCRIPTION - Rather than attempting to express this 
information need as a formal query, the user instead selects a number 
of the top-level clusters (22A-I) that, from their description, seem 
relevant to the topic of interest. In this case, the user selects the 
clusters (22A, 22C, 22H) labeled 'military history*, 'science and 
industry', and 'American society' to form a reduced corpus (24) of the 
indicated subset of articles from Grolier's. In the example the cluster 
labels are idealized. 

USE - In a document-clustering-based browsing procedure for a 
corpus of documents, which is applicable over all natural languages 
that contain a lexical analysis capability. 

ADVANTAGE - Transforms the geometric structure into the logical 
structure, which represents the semantics carried by the documents. A 
virtual field separator technique is employed to utilize information 
carried by a special constituent of documents such as field separators 
and frames, keeping the number of transformation rules small. 

DESCRIPTION OF DRAWING (S) - The drawing is an illustrative diagram 
of preferred embodiment of the Scatter-Gather document browsing method 
according to an embodiment of the present invention. 

reduced corpus (24) 

top-level clusters (22A-I) 
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Abstract (Basic) : US 5974567 A 

NOVELTY - An user partition is created. A ghost partition 
overlapping the user partition is created. Diagnostic software is 
transferred to ghost partition. The ghost partition is disabled after 
determining that ghost partition is not required. 



DETAILED DESCRIPTION - Ghost partition is adapted to contain 
diagnostic software and download verification software. Maximum and 
minimum partition size is determined for the ghost partition 
and ghost partition size is allocated appropriately. An INDEPENDENT 
CLAIM is also included for partition creation and deletion program 
storage device. 

USE - In data storage device with diagnostic system e.g. 
multiplatter disk drive. 

ADVANTAGE - Avoids need to set up large system partition and avoids 
wasting disk space associated with manufacturing diagnostic, as no 
master boot records for any of user or system partition is modified 
during processing. 

DESCRIPTION OF DRAWING (S) - The figure shows the flow chart 
illustrating partition creation and deletion method. 

pp; 20 DwgNo 7/8 

Title Terms: PARTITION; CREATION; DELETE; METHOD; DATA; STORAGE; DEVICE 
Derwent Class: TOl 

International Patent Class (Main) : G06F-011/22 
International Patent Class (Additional) : G06F-011/267 
File Segment: EPI 



17/5/15 (Item 13 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 
(c) 2003 Thomson Derwent. All rts. reserv. 

011466556 **Image available** 

WPI Acc No: 1997-444473/199741 

XRPX Acc No: N97-370100 

Automatic size calculation method for partition members in rooms - 
involves computing size of partition member by selecting one among 
basic specification stored in microcomputer, corresponding to measiored 
size of room space 

Patent Assignee: KOMANI KK (KOMA-N) 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

JP 9203225 A 19970805 JP 9637123 A 19960130 199741 B 

Priority Applications (No Type Date) : JP 9637123 A 19960130 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
JP 9203225 A 3 E04H-001/00 

Abstract (Basic) : JP 9203225 A 

The method uses a laser type distance measuring equipment (1) which 
is connected to a microcomputer (2). The size of room space is measured 
using a laser beam irradiated from the distance measuring equipment. A 
number of partition member basic specifications are stored in the 
microcomputer. 

The size of a partition member is computed by selecting one 
among the number of stored basic specifications corresponding to 
measured size of room space. 

ADVANTAGE - Enables to deduce size of each partition member 
automatically based on size of room. Shortens measurement time. 
Avoids generation of mistake in distance measurement work. 
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Distributed application creation for object oriented environment - 
involves displaying application layout for user manipulation of objects 
with internal representation of objects and boundaries allowing creation 
of server code structure 
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Abstract (Basic): EP 780763 A 

The object oriented application creation method involves displaying 
application layout illustrating object parts and links. The application 
is defined internally. At least one partition boundary is 
displayed and represented internally in response to user action. 

At least one program object is relocated so that it's connection 
with other objects cross at least one partition boundary and 
defining the connections as distributed in the internal connection. 
Client and server objects are determined from the distributed 
connections. In response to a user commit action server code structure 
is generated with a distributed interface for each server. A client 
stub is generated with the distributed interface for each client part 
corresponding to each server. 

ADVANTAGE - Allows effective utilisation of network resources. 
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